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POLYPEPTIDES AND POLYNUCLEOTIDES USEFUL FOR THE 
DIAGNOSIS AND TREATMENT OF PATHOGENIC NEISSERIA 



Technical Field 

The invention relates to materials and 
methodologies for managing the spread of infections 
caused by microorganisms having type 4 pilin, for 
example, .Neisseria, More specifically, it relates to 
polypeptides and antibodies useful in vaccines for the 
treatment of pathologic infections caused by these 
microorganisms. It also relates to polynucleotides 
useful for the recombinant production of these 
polypeptides. In addition, it. relates to polypeptides, 
antibodies, and polynucleotides used for the detection of 
these strains. 

Background Art 

Type 4 pilins are expressed by several 
bacterial genuses, including Neisseria, Moraxella, 
Bacteroides , and Pseudomonas . Species within these 
genuses which have pathogenic members that express type 4 
pilins are, for example, JJ. gonorrhoeae, N. meningitidis, 
Af. bovis, B. nodosus, and P. aeruginosa. In addition, 
the Tcp pilin of V. cholerae is highly homologous to the 
type 4 pilins of other genuses. 
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man. 



The only knovm reservoir of the neisseriae is 
The genus includes two gram- negative species of 
pyogenic cocci that are pathogenic for man: the 
meningococcus {Neisseria meningitidis) and the gonococcus 

5 (Neisseria gonorrheae) . 

N. Meningitidis causes a variety of infections, 
most notably, meningitis and bacteremia. Meningococci 
can be divided into serologic groups on the basis of 
agglutination reactions with immune serum. The present 
10 classification includes groups A through Z. Clinically 
significant new groups encompass Y and W 135. The major 
groups are remarkably heterogeneous, but subclass! - 
f ication with additional serologic markers has been 
possible. Noncapsular antigens have provided the basis 
15 for dividing strains of groups into distinct types. 

Meningococci cause either epidemic or sporadic 
disease, and historically, there has been a cyclic 
variation in the prevalence of meningococcal infection 
with peaks of increased frequency occurring every 8 to 12 
20 years and lasting 4 to € years. The attack rate of 

meningococcal disease is highest for children between 6 
months and l year. In the first half of this century, 
most epidemics of meningococcal disease in the United 
States were caused by group A organisms. In the past two 
25 decades, first group B then group C meningococci were 
responsible for outbreaks in both the military and 
civilian populations. Currently, group B is responsible 
for 50 to 55 percent of reported cases. 

Gonorrhea, which is caused by N. gonorrhea, is 
an infection of columnar and transitional epithelium. 
This disease is the most common reportable communicable 
disease in the United States, and also has world-wide 
prevalence - 
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Although treatment of disease caused by 
gonococci and meningococci are often treated with 
antibiotics, these microorganisms often develop 
antibiotic resistance. Thus, prevention with vaccines is 
5 a preferable mode to contain the spread of infection. 
However, for a variety of reasons, including antigenic 
variation, the development of vaccines has been greatly 
hampered. For example, a vaccine which prevents 
gonorrhea is still lacking. In addition, although 56% of 

10 the causes of meningococcal disease are caused by 

serogroup B, an effective vaccine against this serogroup 
is also lacking. 

N. gonorrheae and N. meningitidis are organisms 
completely adapted to the human host, having no other 

15 ecological niche. They have acquired a large arsenal of 
strategies to overcome the human host defense system. 

The first step in infection with pathological 
forms of these Neisseria is adherence to target cells. 
It is thought that the pili of these microorganisms are a 

20 major virulence factor. For example, it is known that in 
the case of N. gonorrheae, piliated (P + ) variants attach 
much better to susceptible cells than non-piliated (P-) 
variants (Swanson, 1973; Pearce and Buchanan, 1978). 
Moreover, P+ variants, unlike P- variants, are able to 

25 establish an infection in human volunteers (Kellog et al, 
1968) . 

Although the pilus protein elicits an immune 
response, so many antigenic variants exist and continue 
to develop that vaccines against the pilus protein are 
30 not highly effective. 

Pilin is the major subunit of the pilus. 
Expression of pilin is controlled at the pilE locus. 
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Brief Description of th e Invention 

We have isolated and characterized a novel 
protein of pathogenic forms of Neisseria, PilC, that may 
be is associated with the pili of gonococci and 
5 meningococci. We have also isolated and characterized 
genes which encode PilC, i.e., the pilC loci. 

Portions of the DNA sequences of the pilC genes 
are useful as probes to diagnose the presence of the 
relevant Neisseria in samples. These DNAs also make 

10 available polypeptide sequences of immunoreactive 

epitopes encoded within the loci, thus permitting the 
production of polypeptides which are useful as standards 
or reagents in diagnostic tests and/or as components of 
vaccines for microorganisms with type 4 pilin and 

15 containing one or more epitopes that are immunologically 
identifiable with an epitope encoded in pilC of 
Neisseria. Antibodies, both monoclonal and purified 
polyclonal, directed against PilC epitopes are also 
useful for diagnostic tests and as therapeutic agents for 

20 passive immunization. In addition, by utilizing probes 

derived from the DNA sequences, it is possible to isolate 
and sequence portions of the pilC loci from species and 

strains of interest. 

Accordingly, one embodiment of the invention is 
25 a recombinant polynucleotide encoding a polypeptide 

comprised of an immunoreactive epitope of a protein 

encoded in pilC of .Neisseria. 

Another embodiment of the invention is a 

recombinant expression system comprising a polynucleotide 
30 encoding a polypeptide comprised of an immunoreactive 

epitope of a protein encoded in pilC of Neisseria, 

wherein the polynucleotide is operably linked to a 

control sequence compatible with a desired host. 
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Yet another embodiment of the invention is 
purified polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pilC of Neisseria.. 

Another embodiment of the invention is a 
5 recombinant polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pile of Neisseria. 

Still another embodiment of the invention is a 
vaccine composition for the treatment of Neisseria 
infection, comprised of a pharmaceutically acceptable 
10 excipient and of an effective amount of a recombinant 

polypeptide, wherein the polypeptide is comprised of an 
immunoreactive epitope of a protein encoded in pile of 
Neisseria. 

Yet another embodiment of the invention is a 
15 composition comprised of purified polyclonal anti-PilC 
antibodies, wherein the PilC is of Neisseria. 

An additional embodiment of the invention is a 
composition comprised of a monoclonal antibody directed 
against an immunoreactive epitope encoded in pile of 
20 Neisseria. 

Another embodiment of the invention is a method 
for producing antibodies to PilC of Neisseria comprising 
administering to an individual a composition comprised of 
an isolated immunogenic polypeptide containing a PilC 
25 epitope in an amount sufficient to produce an immune 
response. 

Yet another embodiment of the invention is an 
oligomer capable of hybridizing to a sequence in pile of 
Neisseria, wherein the oligomer is comprised of a pile 
30 sequence complementary to at least about 6 contiguous 
nucleotides of pile. 

Still another embodiment of the invention is a 
process for detecting a pilC sequence in an analyte 
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strand, wherein the pilC sequence comprises a selected 
target region, the process comprising: 

(a) providing a sample comprised of an analyte 
strand suspected of containing a selected target pile 

5 sequence; 

(b) providing an oligomer capable of 
hybridizing to the target pile sequence, wherein the 
oligomer is comprised of a pilC targeting sequence 
complementary to at least about 6 contiguous nucleotides 

10 of pllC; 

(c) incubating the sample of (a) with the 
oligomer of (b) under conditions which allow specific 
hybrid duplexes to form between the targeting sequence 
and the target sequence; and 

15 (d) detecting hybrids formed between the 

target sequence, if any, and the oligomer. 

Yet another embodiment of the invention is a 
recombinant polynucleotide comprising a DNA sequence of 
at least 8 contiguous nucleotides from pllC, wherein the 

20 pllC sequence is selected from the group of sequences 
shown in Figure 3, Figure 6, and Figure 7. 

Another embodiment of the invention is a method 
of treating an individual for a Neisseria infection 
comprising administering to the individual antibodies 

25 produced according to claim 31, wherein the antibodies 
are administered in an amount effective to prevent the 
pathology of the infection. 

Brief Description of the Drawings 
30 Figure 1 is a genetic and physical map of pilC 

locus 1, showing the restriction enzyme sites. 

Figure 2 shows the nucleotide sequence and the 
deduced amino acid sequence of the 5' -end of pilCl. 



BNSDOOD: <WO_J213871A1J_j» 



WO 92/13871 




PCT/US92/00863 



-7- 

Figure 3 shows the nucleotide sequence of the 
sense strand of the pilCl gene. 

Figure 4 shows the shows the nucleotide 
sequence of the sense strand of the pilCl gene and the 
5 amino acids encoded therein. 

Figure 5 shows the nucleotide sequence of the 
sense strand of the pilCl gene, and the effect of frame 
shift on the putative gene products encoded therein. 

Figure 6 shows the nucleotide sequences of PCR 
10 amplified fragments demonstrating a variation in the 

length of the G tract and sequence differences in the 5' 
region of the pllC genes. 

Figure 7 shows the DNA sequence of the 3 '-end 
of the pilC2 fragment. 
15 Figure 8 shows the p±lC2 fragment sequence, and 

the putative amino acids encoded therein. 

Figure 9 shows a comparison of the analogous 
portions of pilC2 (top) and pilCl (bottom) DNA sequences, 
and the putative amino acids encoded therein. 

20 

Modes for Carrying Out the Invention 

The present invention provides polypeptides, 
antibodies, and polynucleotides which are useful for the 
detection and treatment of pathogenic microorganisms 

25 having type 4 pilin, for example, Neisseria, Moraxella, 
Bacteroides, and Pseudomonas . 

We have discovered a polypeptide, Pile, which 
is present in N. gonorrheae. This polypeptide is a 110 
kd protein that is closely associated with the pili of 

30 the microorganism. Most strains of N. gonorrheae carry 
two copies of the corresponding genes which encode the 
polypeptide (s) ; these genes have been denoted pilC. 
Expression from the pilC loci is regulated by frequent 
frameshift mutations within a run of G residues in the 
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region encoding the signal peptide. The two pilC genes 
of N. gonorrheae are not identical. Hence, alternate 
expression from either the pllCl or the pilC2 loci gives 
rise to two different forms of PilC. Among nonpiliated 
5 (P") descendants from P + clones, clones were found that 
expressed pilin but not PilC. All P + revertants from 
such PilC" non-piliated clones have regained expression 
of PilC. Hence, phase variation of gonococcal pili can 
be caused by frameshift mutations in pilC. Transposon 
10 inactivation of the expressed pilC2 copy resulted in a 
nonpiliated, pilin producing revertible phenotype. It 
appears, therefore, that PilC is required for assembly of 
pilin subunits into a polymerized pilus fiber in N. 
gonorrheae . 

15 we have cloned and isolated gene, pilCl, from 

N. gonorrheae. In addition, by comparison of this gene 
sequence with a related sequence, we have cloned a 
fragment of the p±12 gene. Moreover, using 
polynucleotide probes derived from isolated pllCl and PCR 

20 amplification, we have detected two possible variants of 
a pilC gene in N. meningitidis. The sequences of pilC 
reported herein appear to be novel, in that there are no 
reported counterparts in Genbank, and no significant 
homologies were found with any available sequences in 

25 that data base. 

The useful materials and processes of the 
present invention are made possible by the provision of 
the sequences of the pilC genes from N. gonorrheae and 
from N. meningitidis. Information present in the 
sequences of the pilC genes allows for the design of 
polypeptides which may be useful as vaccines for 
treatment of pathogenic Neisseria, as diagnostic tools 
for the detection of these microorganisms, and as agents 
for the preparation of antibodies to PilC. In addition, 
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this information allows for the design of polynucleotides 
for the recombinant production of the polypeptides 
derived from Pile, and for the design of oligomers which 
are useful as probes and primers for the detection and 
5 amplification of target regions of pllC. 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the 

10 skill of the art. Such techniques are explained fully in 
the literature. See e.g., Maniatis, Fitsch & Sambrook, 
MOLECULAR CLONING; A LABORATORY MANUAL, Second Edition 
(1989); DNA CLONING, VOLUMES I AND II (D.N Glover ed. 
1985); OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); 

15 NUCLEIC ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins 

eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames & 
S.J. Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. 
Freshney ed. 1986); IMMOBILIZED CELLS AND ENZYMES (IRL 
Press, 1986); B. Perbal, A PRACTICAL GUIDE TO MOLECULAR 

20 CLONING (1984); the series, METHODS IN ENZYMOLOGY 
(Academic Press, Inc.); GENE TRANSFER VECTORS FOR 
MAMMALIAN CELLS (J.H. Miller and M.P. Calos eds. 1987, 
Cold Spring Harbor Laboratory) , Methods in Enzymology 
Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds., 

25 respectively), Mayer and Walker, eds. (1987), 

IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY 
(Academic Press, London) , Scopes, (1987) , PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
( Springer -Ver lag, N.Y.), and HANDBOOK OF EXPERIMENTAL 

30 IMMUNOLOGY , VOLUMES I-IV (D.M. Weir and C. C. Blackwell 
eds 1986) . All patents, patent applications, and 
publications mentioned herein, both supra and infra, are 
hereby incorporated herein by reference. 
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As used herein, a polynucleotide "derived from" 
a designated sequence refers to a polynucleotide sequence 
which is comprised of a sequence of approximately at 
least about 6 nucleotides, preferably at least about 8 
5 nucleotides, more preferably at least about 10-12 

nucleotides, and even more preferably at least about 
15-20 nucleotides corresponding to a region of the 
designated nucleotide sequence. "Corresponding" means 
homologous to or complementary to the designated 
10 sequence. Regions from which typical polynucleotide 

sequences may be "derived" include but are not limited 
to, for example, regions encoding specific epitopes, as 
well as non- transcribed and/or non- translated regions. 

The derived polynucleotide is not necessarily 
15 physically derived from the nucleotide sequence shown, 
but may be generated in any manner, including for 
example, chemical synthesis or DNA replication or reverse 
transcription or transcription. In addition, 
combinations of regions corresponding to that of the 
20 designated sequence may be modified in ways known in the 
art to be consistent with an intended use. 

Similarly, a polypeptide or amino acid sequence 
"derived from" a designated nucleic acid sequence refers 
to a polypeptide having an amino acid sequence identical 
25 to that of a polypeptide encoded in the sequence, or a 

portion thereof wherein the portion consists of at least 
3-5 amino acids, and more preferably at least 8-10 amino 
acids, and even more preferably at least 11-15 amino 
acids, or which is immunologically identifiable with a 
polypeptide encoded in the sequence. This terminology 
also includes a polypeptide expressed from a designated 
nucleic acid sequence. 

A recombinant or derived polypeptide is not 
necessarily translated from a designated nucleic acid 
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sequence. It may be generated in any manner, including 
for example, chemical synthesis, or expression of a 
recombinant expression system, or isolation from a 
microorganism. A recombinant or derived polypeptide may 
5 include one or more analogs of amino acids or unnatural 
amino acids in its sequence. Methods of inserting 
analogs of amino acids into a sequence are known in the 
art. It also may include one or more labels, which are 
known to those of skill in the art. 

10 The term "recombinant polynucleotide" as used 

herein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of 
its origin or manipulation: (1) is not associated with 
all or a portion of a polynucleotide with which it is 

15 associated in nature, (2) is linked to a polynucleotide 

other than that to which it is linked in nature, (3) does 
not occur in nature, or (4) is not in the form of a 
library. 

The term "polynucleotide" as used herein refers 
20 to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleo tides . This term 
refers only to the primary structure of the molecule. 
Thus, this term includes double- and single- stranded DNA 
and RNA. It also includes known types of modifications, 
25 for example, labels which are known in the art, 

methylation, "caps", substitution of one or more of the 
naturally occurring nucleotides with an analog, 
internucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates, 
30 phospho tries ters, phosphoamidates , carbamates, etc.) and 
with charged linkages (e.g., phosphorothioates , 
phosphorodithioates , etc . ) , those containing pendant 
moieties, such as, for example proteins (including for 
e.g., nucleases, toxins, antibodies, signal peptides, 
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poly-L- lysine, etc.), those with intercalators (e.g., 
acridine, psoralen, etc.), those containing chelators 
(e.g., metals, radioactive metals, boron, oxidative 
metals, etc.), those containing alkylators, those with 
5 modified linkages (e.g., alpha anomeric nucleic acids, 

etc.), as well as unmodified forms of the polynucleotide. 

The term "purified polynucleotide" refers to a 
polynucleotide which is essentially free, i.e., contains 
less than about 50%, preferably less than about 70%, and 

10 even more preferably less than about 90% of polypeptides 
with which the polynucleotide is naturally associated. 
Techniques for purifying polynucleotides from bacteria 
are known in the art, and include for example, disruption 
of the bacteria with a chaotropic agent, differential 

15 extraction and separation of the polynucleotide (s) and 
polypeptides by ion- exchange chromatography, affinity 
chromatography, and sedimentation according to density. 

The term "purified polypeptide" refers to a 
polypeptide or fragment thereof which is essentially 

20 free, i.e., contains less than about 50%, preferably less 
than about 70%, and even more preferably less than about 
90%, of cellular components with which the polypeptide is 
naturally associated. Techniques for purifying 
polypeptides are known in the art, and examples of these 

25 techniques are discussed infra. 

"Recombinant host cells", "host cells", 
"cells", "cell lines", "cell cultures", and other such 
terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells 

30 which can be, or have been, used as recipients for 

recombinant vector or other transfer DNA, and include the 
progeny of the original cell which has been trans fected. 
It is understood that the progeny of a single parental 
cell may not necessarily be completely identical in 
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morphology or in genomic or total DNA complement as the 
original parent, due to natural, accidental, or 
deliberate mutation, 

A "replicon" is any genetic element, e.g., a 
5 plasmid, a chromosome, a virus, a cosmid, etc. that 

behaves as an autonomous unit of polynucleotide replica- 
tion within a cell; i.e., capable of replication under 
its own control . 

A "vector" is a replicon in which another 
10 polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached 
segment . 

"Control sequence" refers to polynucleotide 
sequences which are necessary to effect the expression of 

15 coding sequences to which they are ligated. The nature 

of such control sequences differs depending upon the host 
organism; in prokaryotes, such control sequences 
generally include promoter, ribosomal binding site, and 
terminators; in eukaryotes, generally, such control 

20 sequences include promoters, terminators and, in some 
instances, enhancers. The term "control sequences" is 
intended to include, at a minimum, all components whose 
presence is necessary for expression, and may also 
include additional components whose presence is 

25 advantageous, for example, leader sequences . 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a relationship 
permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence 

30 is ligated in such a way that expression of the coding 

sequence is achieved under conditions compatible with the 
control sequences . 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 

35 
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region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
5 into a polypeptide when placed under the control of 

appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5' -terminus and a translation stop codon at 
the 3' -terminus. A coding sequence can include, but is 
10 not limited to mRNA, cDNA, and recombinant polynucleotide 
sequences . 

"Immunologically identifiable with/ as" refers 
to the presence of epitope (s) and polypeptides (s) which 
are also present in the designated polypeptide (s) . 

15 Immunological identity may be determined by antibody 

binding and/or competition in binding; these techniques 
are known to those of average skill in the art, and are 
also illustrated infra. 

As used herein, "epitope" refers to an 

20 antigenic determinant of a polypeptide. An epitope could 
comprise 3 amino acids in a spatial conformation which is 
unique to the epitope. Generally an epitope consists of 
at least 5 such amino acids, and more usually, consists 
of at least 8-10 such amino acids. Methods of 

25 determining the spatial conformation of amino acids are 
known in the art, and include, for example, x-ray 
crystallography and 2 -dimensional nuclear magnetic 
resonance . 

A polypeptide is "immunoreactive" when it is 
30 "immunologically reactive" with an antibody, i.e., when 
it binds to an antibody due to antibody recognition of a 
specific epitope contained within the polypeptide. 
Immunological reactivity may be determined by antibody 
binding, more particularly by the kinetics of antibody 
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binding, and/or by competition in binding using as 
competitor (s) a known polypeptide (s) containing an 
epitope against which the antibody is directed. The 
techniques for determining whether a polypeptide is 
5 immunologically reactive with an antibody are known in 
the art. An "iramunoreactive" polypeptide may also be 
"immunogenic". As used herein, the term "immunogenic 
polypeptide" is a polypeptide that elicits a cellular 
and/or humoral immune response, whether alone or linked 

10 to a carrier in the presence or absence of an adjuvant. 

As used herein, the term "antibody" refers to a 
polypeptide or group of polypeptides which are comprised 
of at least one antibody combining site. An "antibody 
combining site" or "binding domain" is formed from the 

15 folding of variable domains of an antibody molecule (s) to 
form three-dimensional binding spaces with an internal 
surface shape and charge distribution complementary to 
the features of an epitope of an antigen, which allows an 
immunological reaction with the antigen. An antibody 

20 combining site may be formed from a heavy and/or a light 
chain domain (VH and VL, respectively) , which form 
hypervariable loops which contribute to antigen binding. 
The term "antibody" includes, for example, vertebrate 
antibodies, hybrid antibodies, chimeric antibodies, 

25 altered antibodies, univalent antibodies, the Fab 
proteins, and single domain antibodies. 

As used herein, a "single domain antibody" 
(dAb) is an antibody which is comprised of an VH domain, 
which reacts immunologically with a designated antigen. 

30 A dAB does not contain a VL domain, but may contain other 
antigen binding domains known to exist in antibodies, for 
example, the kappa and lambda domains. Methods for 
preparing dABs are known in the art. See, for example, 
Ward et al. (1989) . 
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Antibodies may also be comprised of VH and VL 
domains, as well as other known antigen binding domains. 
Examples of these types of antibodies and methods for 
their preparation are known in the art (see, e.g., U.S. 
5 Patent No. 4,816,467, which is incorporated herein by 
reference) , and include the following. For example, 
"vertebrate antibodies" refers to antibodies which are 
tetramers or aggregates thereof, comprising light and 
heavy chains which are usually aggregated in a "Y" 

10 configuration and which may or may not have covalent 

linkages between the chains. In vertebrate antibodies, 
the amino acid sequences of all the chains of a 
particular antibody are homologous with the chains found 
in one antibody produced by the lymphocyte which produces 

15 that antibody in situ, or in vitro (for example, in 

hybridomas) . Vertebrate antibodies typically include 
native antibodies, for example, purified polyclonal 
antibodies and monoclonal antibodies. Examples of the 
methods for the preparation of these antibodies are 

20 described infra. 

"Hybrid antibodies" are antibodies wherein one 
pair of heavy and light chains is homologous to those in 
a first antibody, while the other pair of heavy and light 
chains is homologous to those in a different second anti- 

25 body. Typically, each of these two pairs will bind 

different epitopes, particularly on different antigens. 
This results in the property of "divalence", i.e., the 
ability to bind two antigens simultaneously. Such 
hybrids may also be formed using chimeric chains, as set 

30 forth below. 

"Chimeric antibodies", are antibodies in which 
the heavy and/ or light chains are fusion proteins. 
Typically the constant domain of the chains is from one 
particular species and/or class, and the variable domains 
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are from a different species and/or class. Also included 
is any antibody in which either or both of the heavy or 
light chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different 
5 sources, whether these sources be differing classes, or 
different species of origin, and whether or not the 
fusion point is at the variable/ constant boundary. Thus, 
it is possible to produce antibodies in which neither the 
constant nor the variable region mimic known antibody 

10 sequences. It then becomes possible, for example, to 

construct antibodies whose variable region has a higher 
specific affinity for a particular antigen, or whose 
constant region can elicit enhanced complement fixation, 
or to make other improvements in properties possessed by 

15 a particular constant region. 

Another example is "altered antibodies", which 
refers to antibodies in which the naturally occurring 
amino acid sequence in a vertebrate antibody has been 
varied- Utilizing recombinant DNA techniques, antibodies 

20 can be redesigned to obtain desired characteristics. The 
possible variations are many, and range from the changing 
of one or more amino acids to the complete redesign of a 
region, for example, the constant region. Changes in the 
constant region, in general, to attain desired cellular 

25 process characteristics, e.g., changes in complement 

fixation, interaction with membranes , and other effector 
functions. Changes in the variable region may be made to 
alter antigen binding characteristics. The antibody may 
also be engineered to aid the specific delivery of a 

3 0 molecule or substance to a specific cell or tissue site. 
The desired alterations may be made by known techniques 
in molecular biology, e.g., recombinant techniques, site 
directed mutagenesis, etc. 
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Yet another example are "univalent antibodies " , 
which are aggregates comprised of a heavy chain/light 
chain dimer bound to the Fc (i.e., constant) region of a 
second heavy chain. This type of antibody escapes 
5 antigenic modulation. See, e.g., Glennie et al. (1982). 

Included also within the definition of antibod- 
ies are n Fab" fragments of antibodies. The "Fab" region 
refers to those portions of the heavy and light chains 
which are roughly equivalent, or analogous, to the 

10 sequences which comprise the branch portion of the heavy 
and light chains, and which have been shown to exhibit 
immunological binding to a specified antigen, but which 
lack the effector Fc portion . "Fab" includes aggregates 
of one heavy and one light chain (commonly known as 

15 Fab'), as well as tetramers containing the 2H and 2L 
chains (referred to as F(ab) 2 ), which are capable of 
selectively reacting with a designated antigen or antigen 
family. "Fab" antibodies may be- divided into subsets 
analogous to those described above, i.e, "vertebrate 

20 Fab", "hybrid Fab", "chimeric Fab", and "altered Fab". 
Methods of producing "Fab" fragments of antibodies are 
known within the art and include, for example, 
proteolysis, and synthesis by recombinant techniques. 

The term "polypeptide" refers to a polymer of 

25 amino acids and does not refer to a specific length of 

the product; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
term also does not refer to or exclude post -expression 
modifications of the polypeptide, for example, 

30 glycosylations , acetylations, phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogs of an amino 
acid (including, for example, unnatural amino acids, 
etc-), polypeptides with substituted linkages, as well as 
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other modifications known in the art, both naturally 
occurring and non- naturally occurring. 

"Transformation 11 , as used herein, refers to the 
insertion of an exogenous polynucleotide into a host 
5 cell, irrespective of the method used for the insertion, 
for example, direct uptake, transduction, f -mating or 
elect roporat ion* The exogenous polynucleotide may be 
maintained as a non- integrated vector, for example, a 
plasmid, or alternatively, may be integrated into the 

10 host genome. 

"Treatment" as used herein refers to 
prophylaxis and/or therapy. 

By "immunogenic" is meant an agent used to 
stimulate the immune system of a living organism, so that 

15 one or more functions of the immune system are increased 
and directed towards the immunogenic agent. Immunogenic 
agents include vaccines . Immunogenic agents can be used 
in the production of antibodies, both isolated polyclonal 
antibodies and monoclonal antibodies, using techniques 

20 known in the art. 

By vaccine is meant an agent used to stimulate 
the immune system of a living organism so that protection 
against or amelioration of future harm is provided. 
Immunization refers to the process of inducing an 

25 increased level of antibodies and/or cellular immune 

response in which T- lymphocytes respond by killing the 
pathogen and/or activate other cells involved in the 
immune response pathway. The antibodies produced as a 
result of immunization may belong to any of the 

30 immunological classes, such as immunoglobulins A, D, E, G 
or M. 

An "individual", as used herein, refers to 
vertebrates, particularly members of the mammalian 
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species, and includes but is not limited to domestic 
animals, sports animals, and primates, including humans. 

As used herein, the "sense strand" of a nucleic 
acid contains the sequence that has sequence homology to 
5 that of mRNA. The "anti- sense strand" contains a 

sequence which is complementary to that of the "sense 
strand" . 

As used herein, the term "probe" refers to a 
polynucleotide which forms a hybrid structure with a 

10 sequence in a target region, due to complementarity of at 
least one sequence in the probe with a sequence in the 
target region. The polynucleotide regions of probes may 
be composed of DNA, and/or RNA, and/or synthetic 
nucleotide analogs. 

15 as used herein, the term "target region" refers 

to a region of the nucleic acid which is to be amplified 
and/or detected. The term "target sequence refers to a 
sequence with which a probe or primer will form a stable 
hybrid under desired conditions. 

20 The term "primer" as used herein refers to an 

oligomer which is capable of acting as a point of 
initiation of synthesis of a polynucleotide strand when 
placed under appropriate conditions. The primer will be 
completely or substantially complementary to a region of 

25 the polynucleotide strand to be copied. Thus, under 

conditions conducive to hybridization, the primer will 
anneal to the complementary region of the analyte strand. 
Upon addition of suitable reactants, (e.g., a polymerase, 
nucleotide triphosphates, and the like) , the primer is 

30 extended by the polymerizing agent to form a copy of the 
analyte strand. The primer may be single- stranded, or 
alternatively may be partially or fully double-stranded. 

The terms "analyte polynucleotide" and 1 analyte 
strand" refer to a single- or double- stranded nucleic 
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acid molecule which is suspected of containing a target 
sequence, and which may be present in a biological 
sample* 

As used herein, a "biological sample" refers to 
5 a sample of tissue or fluid isolated from an individual, 
including but not limited to, for example, plasma, serum, 
spinal fluid, lymph fluid, the external sections of the 
skin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tumors, organs, and 

10 also samples of in vitro cell culture constituents 
(including but not limited to conditioned medium 
resulting from the growth of cells in cell culture 
medium, putatively virally infected cells, recombinant 
cells, and cell components) . 

15 As used herein, the term "oligomer" refers to 

primers and to probes. The term oligomer does not 
connote the size of the molecule. However, typically 
oligomers are no greater than 1000 nucleotides, more 
typically are no greater them 500 nucleotides, even more 

20 typically are no greater than 250 nucleotides; they may 

be no greater than 100 nucleotides, and may be no greater 
than 75 nucleotides, and also may be no greater than 50 
nucleotides in length. 

The term "coupled" as used herein refers to 

25 attachment by covalent bonds or by strong non- covalent 
interactions (e.g., hydrophobic interactions , hydrogen 
bonds, etc.). Covalent bonds may be, for example, ester, 
ether, phosphoester, amide, peptide, imide, carbon- sulfur 
bonds, carbon -phosphorus bonds, and the like. 

30 The term "support" refers to any solid or semi- 

solid surface to which a desired polypeptide or 
polynucleotide may be anchored. Suitable supports 
include glass, plastic, metal, polymer gels, and the 
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like, and may take the form of beads, wells, dipsticks, 
membranes, and the like. 

The term "label" as used herein refers to any 
atom or moiety which can be used to provide a detectable 
5 (preferably quantifiable) signal, and which can be 
attached to a polynucleotide or polypeptide. 

The term "type 4 pilin" as used herein refer to 
pilins that contain a conserved amino terminal 
hydrophobic domain beginning with an amino -terminal 

10 phenylalanine that is methylated upon processing and 

secretion of the pilin. Another characteristic feature 
of type 4 pilins is that in the propilin form they 
contain similar six- or seven-amino acid long leader 
peptides, which are much shorter than typical signal 

15 sequences. Type 4 pilins are expressed by several 
bacterial genuses, including Neisseria, Moraxella, 
Ba.cteroid.es, and Pseudomonas . Species within these 
genuses which express type 4 pilins are, for example, N. 
gonorrhoeae, N. meningitidis, M. bovis, B. nodosus, and 

20 P. aeruginosa. As used herein, the term "type 4 pilin" 
also includes the Tcp pilin of Vibrio, (for example, V. 
cholerae) , that is highly homologous to the type 4 pilins 
of other genuses. Tcp pilin contains the characteristic 
amino -terminal hydrophobic domain as well as having a 

25 modified N-terminal amino acid that in this case may be a 
modified methionine because the Tcp pilin gene encodes a 
methionine residue at the position where all the others 
encode a phenylalanine. Precursor TcpA contains a much 
longer leader sequence than typical type 4 propilins but 

30 retains homology in the region surrounding the processing 
site. 

The term "pile as used herein refers to a gene 
encoding a polypeptide involved in the assembly of type 4 
pilin, which may also be required for attachment of the 
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pilin, and that is comprised of an epitope that is 
immunologically identifiable with an epitope in PilC of 
N. gonorrhae or N. meningitidis. Included within this 
term is any homologous region from Vibrio, tcpC. 
5 As used herein the term "PilC" refers to a 

polypeptide encoded within pilC, and includes TcpC of 
Vibrio. 

The description of the method to retrieve the 
DNA sequences is mostly of historical interest. The 

10 resultant sequences (and their complements) are provided 
herein, and the sequences, or any portion thereof, could 
be prepared using synthetic methods, or by a combination 
of synthetic methods with retrieval of partial sequences 
using methods similar to those described herein. 

15 The description infra, of "walking" the genome 

by isolating overlapping DNA sequences from the N. 
gonorrheae lambda gt-11 library and from an EMBL3 library 
provides one method by which DNAs corresponding to the 
pile genomes from, inter alia, N. gonorrheae and N. 

20 meningitidis , respectively, may be isolated. However, 

given the information provided herein, other methods for 
isolating pilC DNAs from these species, as well as from 
species of other genuses which have type 4 pilin are 
obvious to one of skill in the art. 

25 Characterization of the genes of the pilC loci 

has provided information on the polypeptides encoded 
therein, and on the control of their expression. 
Even though Type 4 pili have been extensively studied in 
several laboratories, little is known about their 

30 assembly. The presence of a specific assembly machinery 
for this class of pili is evident from the fact that the 
pilin gene of JB. nodosus and M. bovis can be properly 
processed and assembled into a pilus in P. aeruginosa but 
not in E. coli (Ellerman et al., 1986; Mattick et al., 
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1987; Beard et al., 1990). Furthermore, the recent 
genetic characterization of TCP pili of Vibrio cholerae 
has revealed that a number of closely linked genes are 
required for pilin processing and assembly into a 
structure (Taylor et al., 1988). The TCP pilin does not 
carry an N-methylphenylalanine but its primary sequence 
is highly homologous to the Type 4 class of pilins. 

The N. gonorrhoeae pilus facilitates adherence 
of the bacterium to a number of eukaryotic cell types 
(Watt et al., 1980) and is thought to play a role in 
bacterial interaction with neutrophils (Fischer and Rest, 
1988) . The pilin is encoded from one or two pilE loci 
(Meyer et al., 1984; Swanson et al., 1986) which most 
likely each form a monocistronic operon. Hence, there 
15 have been no suggestions that genes closely linked to 
pilE are involved in pilus assembly. A dispersed 
location of genes involved in gonococcal pilus assembly 
as well as the rapid occurrence of nonpiliated variants 
generated via recombination with pilin sequences from 
20 silent loci, pilS, have made it extremely difficult to 
identify putative assembly genes for gonococcal pili. 

The PilC protein described herein is a protein 
encoded within a pilC or equivalent (for example, tcpC) 
locus or gene. In N. gonorrhoeae MS11 and most other 
25 gonococcal strains the PilC protein is expressed in small 
amounts. It is the only protein that is enriched in 
highly purified preparations of MS11 pili. PilC was not 
released from a nonpilated MS11 (P'n) variant using the 
same procedure suggesting that this protein interacts 
30 with the polymerized pilus fiber. 

DMA sequence analysis of the cloned pilCl gene 
revealed one long open reading frame that was out of 
frame with its putative AUG initiation codon and 5' end 
encoding the signal peptide. Minute amounts of PilC were 
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expressed in E. coll from pABJ04. Gel purified PilC from 
MS11 contained a lysine residue in position four, whereas 
pllCl had a glutamine codon at this position. A lysine 
codon was, however, found at position four in a number of 
5 PCR amplified 5' pilC fragments suggesting that these 

fragments represent the 5' end of pllC2, which then must 
be ON in MSll. The finding that a miniTnCm insertion in 
pilC2 abolished PilC expression, whereas insert ional 
inactivation of pllCl did not abolish PilC expression 

10 further argues that pllCl is translationally out of frame 
and pilC2 translationally in frame in the MS11 variant we 
are studying. 

PCR amplified fragments of pilCl and pilC2 in 
MSll differed in the number of G residues found in the G 

15 tract. Only 11 or 12 Gs were found in pilCl clones 

(which would both generate an OFF phenotype) while 12 or 

13 Gs were found among pilC2 specific clones. Since 
pilC2 is the expressed gene in the MSll variant under 
study, we believe that this variant carries 13 Gs in 

20 p±lC2 and 12 Gs in pllCl. The frequency of frameshift 

mutations in each locus is not known. However, the lack 
of 13 Gs among pilCl specific fragments and the lack of 

14 Gs among p±lC2 specific fragments suggests that a 
deletion of one G residue occurs at a higher frequency 

25 than the insertion of one G residue. We had expected to 
find amplified fragments from N. gonorrhoeae containing 
10 G residues in the G tract, but found none in the 48 
clones sequenced. If only one G is added or deleted in 
each mutational event, the frequency of G tracts with 10 

30 residues should be low if G tracts normally are 12 or 13 
bp long. 

Frameshif ting in pilCl also occurred in E. 
coll. In this case, however, two variants with 10 
residues were found out of 12 clones sequenced. It may 
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therefore be that there is a selection against in frame 
variants with 10 Gs in N. gonorrhoeae. A change from 
five glycines to four in the signal peptide may for 
example have an effect on the physical properties of the 
5 precursor form of PilC such that the signal peptide is 
not cleaved off. E. coll strain AA10 is recA. 
Therefore, frameshift mutations in the G tract of pllC 
occurs independent of the RecA protein. 

Translational f rameshif ting has been shown to 

10 regulate phase and antigenic variation of the gonococcal 
opacity protein PII that is encoded by a number of opa 
loci showing sequence variations. In this system a 
number of pentameric CTCTT repeats are present in the 
region encoding the signal peptide (Stern et al. 1986) . 

15 Variation in the number of repeats is independent of recA 
in N. gonorrhoeae as well as in E. coll (Murphy et al . , 
1989) . Variation in the expression of lipopolysaccharide 
epitopes in Haemophilus Influenzae was recently explained 
by translational f rameshif ting created by alterations in 

20 the number of CAAT repeats occurring in the 5' end of 
llch (Weiser et al., 1989). In Bordetella pertussis 
frameshift mutations in the regulatory vlr locus occur in 
a run of C residues positioned internally in the gene 
(Stibitz et al., 1989). The C tract was in this case 

25 varying from 6 (in frame) to 7 residues (out of frame) . 
It is not known if this frameshift mutation is programed 
or not. The pilin gene of Bordetella pertussis was 
recently shown to be preceded by a stretch of Cs. 
Frequent mutations affecting the length of this C tract 

30 influenced the transcriptional activity of the pilin gene 
(Willems et al., 1990). 

Variation in the number of the CTCTT repeats in 
opa genes was recently suggested to be due to 
recombination- independent slipped strand mispairing 



BN8DOCID: «WO_^9l21 3871 A1J_j> 



WO 92/13871 




PCT/US92/00863 



-27- 

(Murphy et al M 1989). Mispairing is thought to occur 
between strands subjected to local denaturation and 
should preferentially occur during replication. A number 
of unusual DNA structures (cruciform, Z form, H form) 
5 have been shown to form in vitro within a variety of 
specific DNA sequences. Under normal conditions the B 
form is the most favorable thermodynamically ( Frank - 
Kamenetskii and Voiogodskii, 1984) . Transition to 
alternative conformations requires specific external 

10 conditions, supercoiling being the most physiologic. 
Single stranded (dG) n and (dC) n strands renature more 
slowly than complementary strands with arbitrary 
sequences, and methylation experiments suggest that a 
poly dG chain may form a hairpin- like structure 

15 stabilized by G-G bp (Panyutin et al., 1990). 

Four variant sequences differing outside the G 
tract were obtained by PCR amplif ication of the 5' end of 
pile from four W. gonorrhoeae strains. The region 5' of 
the G tract was invariant, as was the 3' end of the 

20 amplified region. All variation was confined to a region 
located 3' of the G tract. At least some of these 
sequence variations can be explained by mismatch pairing 
events. Thus, the addition of four nucleotides distal to 
the G tract in variant sequence 4 is possible to explain 

25 by a two step mispairing event occurring within variant 
sequence 2. Slip strand mispairing between the two CA 
residues in - GGCGCAGGCGCA- would yield 
-GGCGCAGGCGCAGGCGCA- . A second mispairing event 
occurring between the two C- residues at positions 3 and 5 

30 gives rise to the sequence -GGCAGGCGCAGGCGCA- present in 
variant 4. It may therefore be that a sequence close to 
a poly(G) tract is prone to slipped strand mispairing. 

Gonococcal pilus phase variation is associated 
with an altered nucleotide sequence of pilE via 
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recombinations with silent pllS sequences (Haas and 
Meyer, 1986, Swanson et al., 1986). An irreversible 
switch OFF in pilus expression results from deletions of 
the 5' coding and control regions of the pilE locus 
5 (Swanson et al., 1985). Reversible gonococcal pilus 

phase variation is associated with nucleotide changes in 
pllE resulting in an altered pilin product. It has been 
suggested that the pilins of these variants are assembly 
defective (Bergstrom et al., 1986; Swanson et al., 1986; 

10 Hill et al., 1990). Here we present evidence that switch 
OFF and ON of PilC expression causes pilus expression to 
phase vary. Five out of five P~, pilin producing 
descendants from MSll^ (P + , PilC + ) that expressed pilin 
did not express PilC. All tested P + revertants from the 

15 five P~, PilC" variants had regained expression of PilC. 
The pilin of one nonpiliated PilC OFF-switcher (variant 
8) differed by eight amino acids from that of the parent. 
The fact that one piliated PilC + backswitcher (8:1) 
expressed a pilin identical in sequence to the 

20 nonpiliated variant (8) strongly suggests that the 

regained expression of pili is due to an ON- switch in 
PilC expression. The above results also imply that the 
nonpiliated phenotype of variant 8 is not due to the 
alterations in the pilin relative to the parental strain 

25 but to an OFF- switch of PilC. The finding that mTnCm 
insertions resulted in P + colonies when inserted into 
pilCl and P" colonies when inserted into the actively 
expressing p±lC2 locus offers further evidence that PilC 
is essential for the biogenesis of gonococcal pili. P" , 

30 p±lC2z :mTnCm-12 insertion mutants reverted to P + colony 
morphology at a low frequency. These revertants most 
likely represent frameshif ting mutants in pllCl resulting 
in expression of PilC from this locus. A double mutant 
in pllCl and pilC2 was stably nonpiliated, expressed 
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pilin, but expressed amounts of pilin that did not 
express any pili when examined by transmission electron 
microscopy. It is therefore believed that out of frame 
mutations of both pilCl and p±lC2 will abolish pili 
5 formation. 

At this stage we cannot exclude the possibility 
that some PilC" variants f rom MSII^ (P + , PilC" ) are 
generated by transformation of pllCl sequences and 
homologous recombination with p±lC2 thus generating 

10 variants with two pllCl 5 ' ends at both pilC loci. PilC + 
revertants from PilC" clones must, however, all be due to 
frameshift mutations in either pilCl or pilC2. 

We propose that PilC forms an outer membrane 
pore or assembly center enabling the pilin subunits to be 

15 assembled and translocated across the outer membrane 

analogous to the proposed function of the high molecular 
weight proteins required for the assembly of 
enterobacterial pili (the latter of which is discussed in 
Norgren et al., 1987). Alternatively, PilC may act as an 

20 initiator for polymerization. In the latter case PilC 
would be expected to be located at the tip of the 
polymerized pilus. 

It is possible that the alternate expression of 
PilC from two structurally different pilC loci is yet 

25 another example of antigenic variation in Neisseria 
gonorrhoeae. It is, however, possible that this 
variation could have functional implications as well. 
Each class of E. coll pili utilizes a different outer 
membrane pore/assembly protein. Hence, pilin subunits 

30 and/ or periplasmic chaperone complexes may specifically 
interact with an exposed region of the protein allowing 
polymerization of pilus subunit proteins. The repertoire 
of antigenic variants of gonococcal pilins is vast 
(Hagblom et al., 1985) . It may be that only certain 
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pilin variants are assembled via PilCl and PilC2 
respectively. Alternatively, if PilC acts as an 
initiator it could also possess other properties such as 
being involved in Pilus mediated attachment. 
5 In one embodiment of the invention, 

immunogenically active polypeptides encoded within pile 
are prepared. The availability of pile DNA sequences, 
either those isolated by utilizing the DNA sequences 
described in the Examples, or nucleotide sequences 
10 derived therefrom (including segments and modifications 
of the sequence) , permits the construction of expression 
vectors encoding immunologically reactive regions of the 
polypeptide encoded in either strand. Immunological 
reactivity may be determined by immunoassay using 
15 antibodies raised to PilC. Fragments encoding the 

desired polypeptides are derived from the DNA clones 
using conventional restriction digestion or by synthetic 
methods, and are ligated into vectors which may, for 
example, contain portions of fusion sequences such as 
20 beta-galactosidase or superoxide dismutase (SOD) , prefer- 
ably SOD. Methods and vectors which are useful for the 
production of polypeptides which contain fusion sequences 
of SOD are described in European Patent Office 
Publication number 0196056, published October 1, 1986. 
25 Any desired portion of the pile DNA containing an open 

reading frame, in either sense strand, can be obtained as 
a recombinant polypeptide, such as a mature or fusion 
protein; alternatively, a polypeptide encoded in the DNA 
can be provided by chemical synthesis. 
30 The DNA encoding the desired polypeptide, 

whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be 
ligated into expression vectors suitable for any 
convenient host. Both eukaryotic and prokaryotic host 
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systems are presently used in forming recombinant 
polypeptides, and a summary of some of the more common 
control systems and host cell lines is given infra. The 
polypeptide is then isolated from lysed cells or from the 
5 culture medium and purified to the extent needed for its 
intended use. Purification may be by techniques known in 
the art, for example, differential extraction, salt 
fractionation, chromatography on ion exchange resins, 
affinity chromatography, centrifugation, and the like. 

10 See, for example, Methods in Enzymology for a variety of 
methods for purifying proteins. Such polypeptides can be 
used as diagnostics, or those which give rise to 
neutralizing antibodies may be formulated into vaccines. 
Antibodies raised against these polypeptides can also be 

15 used as diagnostics, or for passive immunotherapy. 

The Pile antigens may also be isolated from 
meningococci and from gonococci. The bacteria may be 
grown by conditions known in the art, some of which are 
described infra. In addition, a method for isolating 

20 PilC from gonococci is described infra. 

In another embodiment of the invention, the 
immunoreactive polypeptides may be conjugated with 
carrier. An antigenic region of a polypeptide is gener- 
ally relatively small- -typically 8 to 10 amino acids or 

25 less in length. Fragments of as few as 5 amino acids may 
characterize an antigenic region. These segments may 
correspond to regions of PilC antigen. Accordingly, 
using the DNAs of pile as a basis, DNAs encoding short 
segments of PilC polypeptides can be expressed 

30 recombinantly either as fusion proteins, or as isolated 
polypeptides. In addition, short amino acid sequences 
can be conveniently obtained by chemical synthesis. In 
instances wherein the synthesized polypeptide is . 
correctly configured so as to provide the correct 

35 
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epitope, but is too small to be immunogenic, the 
polypeptide may be linked to a suitable carrier. 

A number of techniques for obtaining such link- 
age are known in the art, including the formation of 
5 disulfide linkages using 

N-succinimidyl-3- (2 -pyridyl-thio) propionate (SPDP) and 
succinimidyl 

4- (N-maleimido -methyl) cyclohexane- 1-carboxylate (SMCC) 
obtained from Pierce Company, Rockford, Illinois, (if the 

10 peptide lacks a sulfhydryl group, this can be provided by 
addition of a cysteine residue.) These reagents create a 
disulfide linkage between themselves and peptide cysteine 
residues on one protein and an amide linkage through the 
epsilon-amino on a lysine, or other free amino group in 

15 the other. A variety of such disulf ide/amide- forming 
agents are known. See, for example, Immun. Rev. (1982) 
£2:185. Other bifunctional coupling agents form a 
thioether rather than a disulfide linkage. Many of these 
thio- ether- forming agents are commercially available and 

20 include reactive esters of 6-maleimidocaproic acid, 
2-bromoacetic acid, 2-iodoacetic acid, 

4- (N-maleimido -methyl) cyclohexane- 1-carboxylic acid, and 

the like. The carboxyl groups can be activated by 

combining them with succinimide or 
25 i-hydroxyl-2-nitro-4-sulfonic acid, sodium salt. 

Additional methods of coupling antigens employs the 

rotavirus/ "binding peptide" system described in EPO Pub. 

No. 259,149, the disclosure of which is incorporated 

herein by reference. The foregoing list is not meant to 
30 be exhaustive, and modifications of the named compounds 

can clearly be used. 

Any carrier may be used which does not itself 

induce the production of antibodies harmful to the host. 

Suitable carriers are typically large, slowly metabolized 

35 
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macromolecules such as proteins; polysaccharides, such as 
latex functionalized sepharose, agarose, cellulose, cel- 
lulose beads and the like; polymeric amino acids, such as 
polyglutamic acid, polylysine, and the like; amino acid 
5 copolymers; and inactive virus particles or attenuated 
bacteria of other strains, for example, those of 
Salmonella. Especially useful protein substrates are 
serum albumins, keyhole limpet hemocyanin, immunoglobulin 
molecules, thyroglobulin, ovalbumin, tetanus toxoid, and 

10 other proteins well known to those skilled in the art. 

In addition to full-length PilC proteins, 
polypeptides comprising truncated PilC amino acid 
sequences encoding at least one immunologically reactive 
epitope are useful immunological reagents- For example, 

15 polypeptides comprising such truncated sequences can be 
used as reagents in an immunoassay . These polypeptides 
also are candidate subunit antigens in compositions for 
antiserum production or vaccines. While these truncated 
sequences can be produced by various known treatments of 

20 native bacterial protein, it is generally preferred to 
make synthetic or recombinant polypeptides comprising a 
PilC sequence. Polypeptides comprising these truncated 
PilC sequences can be made up entirely of PilC sequences 
(one or more epitopes, either contiguous or 

25 noncontiguous) , or PilC sequences and heterologous 
sequences in a fusion protein. Useful heterologous 
sequences include sequences that provide for secretion 
from a recombinant host, enhance the immunological 
reactivity of the PilC epitope (s), or facilitate the 

30 coupling of the polypeptide to an immunoassay support or 
a vaccine carrier. See, e.g., EPO Pub. No. 116,201; 
U.S. Pat. No. 4,722,840; EPO Pub. No. 259,149; U.S. 
Pat. No. 4,629,783, the disclosures of which are 
incorporated herein by reference. 
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The size of polypeptides comprising the 
truncated PilC sequences can vary widely, the minimum 
size being a sequence of sufficient size to provide an 
immunologically reactive PilC epitope, while the maximum 
5 size is not critical . For convenience, the maximum size 
usually is not substantially greater than that required 
to provide the desired PilC epitopes and function (s) of 
the heterologous sequence, if any. Typically, the 
truncated PilC amino acid sequence will range from about 

10 5 to about 100 amino acids in length. More typically, 

however, the PilC sequence will be a maximum of about 50 
amino acids in length, preferably a maximum of about 30 
amino acids. It is usually desirable to select PilC 
sequences of at least about 10, 12 or 15 amino acids, up 

15 to a maximum of about 20 or 25 amino acids. 

Truncated PilC amino acid sequences comprising 
epitopes can be identified in a number of ways. For 
example, the entire PilC protein sequence can be screened 
by preparing a series of short peptides that together 

20 span the entire protein sequence. By starting with, for 
example, lOOmer polypeptides, it would be routine to test 
each polypeptide for the presence of epitope (s) showing a 
desired reactivity, and then testing progressively 
smaller and overlapping fragments from an identified 

25 lOOmer to map the epitope of interest. Screening such 
peptides in an immunoassay is within the skill of the 
art. It is also known to carry out a computer analysis 
of a protein sequence to identify potential epitopes, and 
then prepare oligopeptides comprising the identified 

30 regions for screening. 

In another embodiment of the invention, the 
immunogenicity of the epitopes of PilC may also be 
enhanced by preparing them in mammalian or yeast systems 
fused with or assembled with particle- forming proteins 

35 
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such as, for example, that associated with hepatitis B 
surface antigen. See, e.g., U.S. Pat. No. 4,722,840. 
Constructs wherein the Pile epitope is linked directly to 
the particle- forming protein coding sequences produce 
5 hybrids which are immunogenic with respect to the PilC 
epitope. In addition, all of the vectors prepared 
include epitopes specific to HBV, having various degrees 
of immunogenicity, such as, for example, the pre-S 
peptide. Thus, particles constructed from particle 

10 forming protein which include PilC sequences are 

immunogenic with respect to the microorganism encoding 
the PilC epitope (for example, Neisseria, Vibrio, 
Moraxella, Bacteroides, or Pseudomonas) and HBV. 

Hepatitis surface antigen (HBSAg) has been 

15 shown to be formed and assembled into particles in 5. 

cerevisiae (Valenzuela et al. (1982)), as well as in, for 
example, mammalian cells (Valenzuela, P., et al. (1984)). 
The formation of such particles has been shown to enhance 
the immunogenicity of the monomer subunit. The 

20 constructs may also include the immunodominant epitope of 
HBSAg, comprising the 55 amino acids of the presurface 
(pre-S) region. Neurath et al. (1984) . Constructs of 
the pre-S-HBSAg particle expressible in yeast are 
disclosed in EPO 174,444, published March 19, 1986; 

25 hybrids including heterologous viral sequences for yeast 
expression are disclosed in EPO 175,261, published March 
26, 1966. These constructs may also be expressed in 
mammalian cells such as Chinese hamster ovary (CHO) cells 
using an SV40-dihydrof olate reductase vector (Michelle et 

30 al. (1984)). 

In addition, portions of the particle -forming 
protein coding sequence may be replaced with codons 
encoding a PilC epitope. In this replacement, regions 
which are not required to mediate the aggregation of the 

35 
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units to form immunogenic particles in yeast or mammals 
can be deleted, thus eliminating additional HBV antigenic 
sites from competition with the PilC epitope. 

In another embodiment of the invention, the 
5 immunoreactive polypeptides encoded in pilC are prepared 
into vaccines. Vaccines may be prepared from one or more 
immunogenic polypeptides derived from pilC. If 
recombinant, these polypeptides may be expressed in 
various host cells (e.g., bacteria, yeast, insect, or 

10 mammalian cells) , or alternatively may be isolated from 

the bacterial preparations. In addition to the above, it 
is also possible to prepare live vaccines of attenuated 
microorganisms which express one or more recombinant 
polypeptides derived from the pilC gene. Suitable 

15 attenuated microorganisms are known in the art and 

include, for example, viruses {e.g., vaccinia virus (see 
Brown et al. (1986)), as well as bacteria. 

The preparation of vaccines which contain an 
immunogenic polypeptide (s) as active ingredients, is 

20 known to one skilled in the art. Typically, such 

vaccines are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for solu- 
tion in, or suspension in, liquid prior to injection may 
also be prepared. The preparation may also be 

25 emulsified, or the protein encapsulated in liposomes. 

The active immunogenic ingredients are often mixed with 
excipients which are pharmaceutically acceptable and 
compatible with the active ingredient. Suitable 
excipients are, for example, water, saline, dextrose, 

30 glycerol, ethanol, or the like and combinations thereof. 
In addition, if desired, the vaccine may contain minor 
amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants 
which enhance the effectiveness of the vaccine. Examples 
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of adjuvants which may be effective include but are not 
limited to: aluminum hydroxide, N-acetyl -muramyl - 
L-threonyl-D-isoglutamine (thr-MDP) , N-acetyl -nor- 
muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to 
5 as nor-MDP) , N-acetylmuramyl-L-alanyl-D- isoglutaminyl- 

L-alanine-2- (1' -2' -dipalmitoyl-sn-glycero-3 -hydroxyphosph 
oryloxy) -ethylamine (CGP 19835A, referred to as MTP-PE) # 
and RIBI, which contains three components extracted from 
bacteria, monophosphoryl lipid A, trehalose dimycolate 

10 and cell wall skeleton (MPL+TDM+CWS) in a 2% 

squalene/Tween 80 emulsion. The effectiveness of an 
adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing a PilC immunoreactive sequence resulting from 

15 administration of this polypeptide in vaccines which are 
also comprised of the various adjuvants. 

The vaccines are conventionally administered 
parenterally, by injection, for example, either 
subcutaneous ly or intramuscularly. Additional 

20 formulations which are suitable for other modes of 

administration include suppositories and, in some cases, 
oral formulations. For suppositories, traditional 
binders and carriers may include, for example, 
polyalkylene glycols or triglycerides; such suppositories 

25 may be formed from mixtures containing the active 

ingredient in the range of 0.5% to 10%, preferably l%-2%. 
Oral formulations include such normally employed 
excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium 

30 saccharine, cellulose, magnesium carbonate, and the like. 
These compositions take the form of solutions, suspen- 
sions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10%- 95% of active 
ingredient, preferably 25%- 70%. 
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The proteins may be formulated into the vaccine 
as neutral or salt forms. Pharmaceutically acceptable 
salts include the acid addition salts (formed with free 
amino groups of the peptide) and which are formed with 
5 inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids such as acetic, 
oxalic, tartaric, maleic, and the like. Salts formed 
with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, 

10 ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylairdne, trimethylamine, 2-ethylamino 
ethanol, histidine, procaine, and the like. 

The vaccines are administered in a manner 
compatible with the dosage formulation, and in such 

15 amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 micrograms to 250 micrograms 
of antigen per dose, depends on the subject to be 
treated, capacity of the subject's immune system to 

20 synthesize antibodies, and the degree of protection 

desired. Precise amounts of active ingredient required 
to be administered may depend on the judgment of the 
practitioner and may be peculiar to each subject. 

The vaccine may be given in a single dose 

25 schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed 
by other doses given at subsequent time intervals 
required to maintain and or reenf orce the immune 

30 response, for example, at 1-4 months for a second dose, 

and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at least in part, be 
determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 
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In addition, the vaccine containing the im- 
munogenic antigen (s) derived from pilC may be 
administered in conjunction with other immunoregulatory 
agents, for example, immune globulins. 
5 Another embodiment of the invention are 

antibodies which react immunologically with PilC 
epitopes- The immunogenic polypeptides prepared as 
described above are used to produce antibodies, including 
polyclonal and monoclonal. If polyclonal antibodies are 

10 desired, a selected mammal (e.g., mouse, rabbit, goat, 

horse, etc.) is immunized with an immunogenic polypeptide 
bearing a PilC epitope (s) . Serum from the immunized 
animal is collected and treated according to known 
procedures. If serum containing polyclonal antibodies to 

15 a PilC epitope (i.e., an epitope encoded within pilC) 
contains antibodies to other antigens, the polyclonal 
antibodies can be purified by immunoaf f inity 
chromatography. Techniques for producing and processing 
polyclonal antisera are known in the art, see for 

20 example, Mayer and Walker (1987) . 

Monoclonal antibodies directed against PilC 
epitopes can also be readily produced by one skilled in 
the art. The general methodology for making monoclonal 
antibodies by hybridomas is well known. Immortal 

25 antibody -producing cell lines can be created by cell 
fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA, or 
transfection with Epstein-Barr virus. See, e.g., M. 
Schreier et al. (1980) ; Hammerling et al. (1981) ; Kennett 

30 et al. (1980); see also . U.S. Patent Nos. 4,341,761; 

4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 
4,491,632; and 4,493,890. Panels of monoclonal 
antibodies produced against PilC epitopes can be screened 
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for various properties; i.e., for isotype, epitope affin- 
ity, etc. 

Antibodies, both monoclonal and polyclonal, 
which are directed against PilC epitopes are particularly 
5 useful in diagnosis, and those which are neutralizing are 
useful in passive immunotherapy. Methods for introducing 
antibodies into an individual to accomplish passive 
immunotherapy are known in the art. In addition, 
monoclonal antibodies, in particular, may be used to 

10 raise anti-idiotype antibodies. 

Ant i-idio type antibodies are immunoglobulins 
which carry an "internal image" of the antigen of the 
infectious agent against which protection is desired. 
See, for example, Nisonoff, A., et al. (1981) and 

15 Dreesman et al. (1985). Techniques for raising 

anti-idiotype antibodies are known in the art. See, for 
example, Grzych (1985), MacNamara et al. (1984), and 
Uytdehaag et al. (1985) . These anti-idiotype antibodies 
may also be useful for treatment, vaccination and/ or 

20 diagnosis of the relevant microorganism encoding the 
antigen of interest, (for example. Neisseria, 
Pseudomonas, Moraxella, Bacteroides, or Vibrio) as well 
as for an elucidation of the immunogenic regions of PilC. 

Another embodiment of the invention concerns 

25 immunoassays and diagnostic kits. The polypeptides which 
contain epitopes encoded in pilC which are immunoreactive 
with anti-PilC antibodies in biological samples are 
useful in immunoassays to detect presence of anti-PilC 
antibodies, or the presence of the relevant microorganism 

30 or its antigens in biological samples. Design of the 

immunoassays is subject to a great deal of variation, and 
many formats are known in the art. The immunoassay will 
utilize a polypeptide comprised of at least one epitope 
derived from PilC or encoded in pilC* In one embodiment , 
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the immunoassay uses a combination of epitopes including 
the one derived from PilC or encoded in pllC. These 
epitopes may be derived from the same or from different 
polypeptides, and may be in separate recombinant or 
5 natural polypeptides, or together in the same recombinant 
polypeptides. An immunoassay may use, for example, a 
monoclonal antibody directed towards an epitope (s), a 
combination of monoclonal antibodies directed towards 
epitopes of one antigen, monoclonal antibodies directed 

10 towards epitopes of different antigens, polyclonal 
antibodies directed towards the same antigen, or 
polyclonal antibodies directed towards different 
antigens. Protocols may be based, for example, upon 
competition, or direct reaction, or sandwich type assays. 

15 Protocols may also, for example, use solid supports, or 
may be by immunoprecipitation. Most assays involve the 
use of labeled antibody or polypeptide; the labels may 
be, for example, enzymatic, fluorescent, 

chemiluminescent , radioactive, or dye molecules. Assays 
20 which amplify the signals from the probe are also known; 
examples of which are assays which utilize biotin and 
avidin, and enzyme- labeled and mediated immunoassays, 
such as ELISA assays. 

Typically, an immunoassay for an anti-PilC 
25 antibody (s) will involve selecting and preparing the test 
sample suspected of containing the antibodies, such as a 
biological sample, then incubating it with an 
immunoreactive (also called antigenic) polypeptide (s) 
containing at least one epitope encoded in pilC. The 
30 incubation is under conditions that allow 

antigen- antibody complexes to form. Suitable incubation 
conditions are well known in the art. Subsequent to the 
incubation, complexes which are formed which contain the 
immunoreactive polypeptide are detected. The immunoassay 

35 



BNSDOCtD: <WO_|B21 3ST1A1JL> 



WO 92/13871 w W PCT/US92/00863 



-42- 

may be, without limitations, in a heterogenous or in a 
homogeneous format, and of a standard or competitive 
type. 

In a heterogeneous format, the polypeptide is 
5 typically bound to a solid support to facilitate separa- 
tion of the sample from the polypeptide after incubation. 
Examples of solid supports that can be used are nitro- 
cellulose (e.g., in membrane or microtiter well form), 
polyvinyl chloride (e.g., in sheets or microtiter wells), 
10 polystyrene latex (e.g., in beads or microtiter plates, 
polyvinylidine fluoride (known as Immulon) , diazotized 

paper, nylon membranes, activated beads, and Protein A 

1 2 
beads. For example, Dynatech Immulon or Immulon 

microtiter plates or 0.25 inch polystyrene beads (Preci- 
15 sion Plastic Ball) can be used in the heterogeneous 
format. The solid support containing the antigenic 
polypeptide is typically washed after separating it from 
the test sample, and prior to detection of bound antibod- 
ies. Both standard and competitive formats are known in 
20 the art. 

In a homogeneous format, the test sample is 
incubated with antigen in solution. For example, it may 
be under conditions that will precipitate any 
antigen- antibody complexes which are formed. Both 

25 standard and competitive formats for these assays are 
known in the art. 

In a standard format, the amount of anti-PilC 
antibodies forming the antibody- antigen complex is 
directly monitored. This may be accomplished by 

30 determining whether labeled ant i -xenogeneic (e.g., 

anti -human) antibodies which recognize an epitope on 
anti-PilC antibodies will bind due to complex formation. 
In a competitive format, the amount of anti-PilC 
antibodies in the sample is deduced by monitoring the 

35 
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competitive effect on the binding of a known amount of 
labeled antibody (or other competing ligand) in the 
complex. 

Complexes formed comprising anti-PilC antibody 
5 (or, in the case of competitive assays, the amount of 
competing antibody) are detected by any of a number of 
known techniques, depending on the format. For example, 
unlabeled anti-PilC antibodies in the complex may be 
detected using a conjugate of antixenogeneic Ig complexed 

10 with a label, (e.g., an enzyme label). 

In immunoassays where PilC polypeptides are the 
analyte, the test sample, which may be a biological 
sample, is incubated with anti-PilC antibodies under 
conditions that allow the formation of antigen-antibody 

15 complexes. Various formats can be employed. For 
example, a "sandwich assay" may be employed, where 
antibody bound to a solid support is incubated with the 
test sample; washed; incubated with a second, labeled 
antibody to the analyte, and the support is washed again. 

20 Analyte is detected by determining if the second antibody 
is bound to the support. In a competitive format, which 
can be either heterogeneous or homogeneous, a test sample 
is usually incubated with antibody and a labeled, 
competing antigen is also incubated, either sequentially 

25 or simultaneously. These and other formats are well 
known in the art. 

The antigenic regions of the polypeptides 
encoded in pile can be mapped and identified by screening 
the antigenicity of expression products of pilC DNAs 

30 which encode portions of the PilC. The expression 

products may be from a variety of expression systems, 
including, for example bacterial systems, yeast systems, 
insect systems, and eukaryotic cell systems. In 
addition, studies giving rise to an antigenicity index 
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and hydrophobicity/hydrophilicity profile give rise to 
information concerning the probability of a region's 
antigenicity . 

Efficient detection systems for infection with 
5 pathogenic microorganisms, (for example, Neisseria, 
Pseudomonas, Bacteroides , Moraxella, or Vibrio) may 
include the use of panels of epitopes. The epitopes in 
the panel may be constructed into one or multiple 
polypeptides. At least one of the epitopes will be 
10 encoded in pile or derived from PilC. The assays for the 
varying epitopes may be sequential or simultaneous. 

Kits suitable for immunodiagnosis and contain- 
ing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the 
15 polypeptides of the invention containing PilC epitopes 
(i.e., epitopes encoded within pilC) or antibodies 
directed against PilC epitopes in suitable containers. 
The kit may also contain other reagents, for example, 
buffer and standard, as well as other materials required 
20 for the conduct of the assay, as well as a suitable set 
of instructions for conducting the assay using the kit 
materials . 

Another embodiment of the invention are 
oligomers. Using the disclosed portions of the pile DNAs 

25 as a basis, oligomers of approximately 8 nucleotides or 

more can be prepared, either by excision from recombinant 
polynucleotides or by synthetic methods which are known 
in the art. These oligomers can serve as probes for the 
detection (including isolation and/or labeling) of 

30 polynucleotides which contain pilC sequences, and/or as 
primers for the transcription and/or replication of 
targeted pilC sequences. The oligomers contain a 
targeting polynucleotide sequence, which is comprised of 
nucleotides which are complementary to a target pilC 



BNGDOCtD: «WO_j9213871A1JL> 



WO 92/13871 




PCT/US92/00863 



-45- 

nucleotide sequence; the sequence is of sufficient length 
and complementarity with the pile sequence to form a 
duplex which has sufficient stability for the purpose 
intended. For example, if the purpose is the isolation, 
5 via immobilization, of an analyte containing a target 
pile sequence, the oligomers would contain a 
polynucleotide region which is of sufficient length and 
complementarity to the targeted pile sequence to afford 
sufficient duplex stability to immobilize the analyte on 

10 a solid surface, via its binding to the oligomers, under 
the isolation conditions. For example, also, if the 
oligomers are to serve as primers for the transcription 
and/or replication of target pile sequences in an analyte 
polynucleotide, the oligomers would contain a 

15 polynucleotide region of sufficient length and 

complementarity to a region flanking the targeted pile 
sequence to allow the polymerizing agent to continue 
replication from the primers which are in stable duplex 
form with the target sequence, under the polymerizing 

20 conditions. The oligomers may contain a minimum of about 
4 contiguous nucleotides which are complementary to a 
targeted pile sequence; usually the oligomers will 
contain a minimum of about 8 contiguous nucleotides which 
are complementary to the targeted pile sequence, and 

25 preferably will contain a minimum of about 14 contiguous 
nucleotides which are complementary to the targeted pile 
sequence. 

The oligomer, however, need not consist only of 
the sequence which is complementary to the targeted pllC 
30 sequence. It may contain in addition, nucleotide 

sequences or other moieties which are suitable for the 
purposes for which the oligomers are used. For example, 
if the oligomers are used as primers for the 
amplification of targeted pile sequences via the 
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polymerase chain reaction (PCR) , they may contain 
sequences which, when in duplex, form restriction enzyme 
sites which facilitate the cloning of the amplified 
sequences. Other types of moieties or sequences which 
5 are useful of which the oligomers may be comprised or 
coupled to, are those which are known in the art to be 
suitable for a variety of purposes, including the 
labeling of nucleotide probes. 

In the basic nucleic acid hybridization assay, 

10 single- stranded analyte nucleic acid (either DNA or RNA) 
is hybridized to a nucleic acid probe, and resulting 
duplexes are detected. The probes for pllC sequences 
(natural or derived) are a length which allows the 
detection of these sequences by hybridization. While 

15 6-8 nucleotides may be a workable length, sequences of 

10-12 nucleotides are preferred, and about 20 nucleotides 
or more appears optimal. Preferably, these sequences 
will derive from regions which lack heterogeneity. These 
probes can be prepared using routine methods, including 

20 automated oligonucleotide synthetic methods. For use as 
probes, complete complementarity is desirable, although 
it may be unnecessary as the length of the fragment is 
increased. 

For use of such probes as agents to detect the 
25 presence of pile sequences, the sample to be analyzed 

(which may be biological) may be treated, if desired, to 
extract the nucleic acids contained therein. The 
resulting nucleic acid from the sample may be subjected 
to gel electrophoresis or other size separation 
30 techniques; alternatively, the nucleic acid sample may be 
dot blotted without size separation. In order to form 
hybrid duplexes with the targeting sequence of the probe, 
the targeted region of the analyte nucleic acid must be 
in single -stranded form. The latter may occur naturally; 
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alternatively, it may be accomplished by denaturation. 
Denaturation can be accomplished by various techniques 
known in the art. Subsequent to denaturation, the 
analyte nucleic acid and probe are incubated under 
5 conditions which promote stable hybrid formation of the 
target sequence in the probe with the putative targeted 
sequence in the analyte, and the resulting duplexes 
containing probe (s) are detected. 

Detection of the resulting duplex, if any, is 

10 usually accomplished by the use of labeled probes; 

alternatively, the probe may be unlabeled, but may be 
detectable by specific binding with a ligand which is 
labeled, either directly or indirectly. Suitable labels, 
and methods for labeling probes and ligands are known in 

15 the art, and include, for example, radioactive labels 
which may be incorporated by known methods (e.g., nick 
translation or kinasing) , biotin, fluorescent groups, 
chemiluminescent groups (e.g., doxetanes, particularly 
triggered dioxetands) , enzymes, antibodies, and the like. 

20 Variations of this basic scheme are known in the art. 

If the targeted pile sequences are expected to 
be present at relatively low levels, amplification may be 
required for their detection. Such techniques are known 
in the art. For example, the Enzo Biochemical 

25 Corporation "Bio -Bridge" system uses terminal 
deoxynucleotide transferase to add unmodified 
3 ' -poly- dT- tails to a DNA probe. The poly dT- tailed 
probe is hybridized to the target nucleotide sequence, 
and then to a biotin-modif ied poly-A. PCT application 

30 84/03520 and EPA124221 describe a DNA hybridization 
assay in which: (1) analyte is annealed to a 
single- stranded DNA probe that is complementary to an 
enzyme- labeled oligonucleotide; and (2) the resulting 
tailed duplex is hybridized to an enzyme- labeled 
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oligonucleotide. EPA 204510 describes a DNA 
hybridization assay in which analyte DNA is contacted 
with a probe that has a tail, such as a poly-dT tail, an 
amplifier strand that has a sequence that hybridizes to 
5 the tail of the probe, such as a poly- A sequence, and 
which is capable of binding a plurality of labeled 
strands* A particularly desirable technique may first 
involve amplification of the target pile sequences. The 
target pile sequences in sera may be amplified, for 
10 example, to approximately 10 6 sequences/ml. This may be 
accomplished, for example, by the polymerase chain 
reactions (PCR) technique described by Saiki et al. 
(1986), by Mullis, U.S. Patent No. 4,683,195, and by 
Mullis et al. U.S. Patent No. 4,683,202. Amplification 
15 may be prior to, or preferably subsequent to purification 
of the pllC target sequence. For example, amplification 
may be utilized in conjunction with the assay methods 
described in U.S. Patent No. 4,868,105, or if even 
further amplification is desired, in conjunction with the 
20 hybridization system in EPO Publication No. 317,077. 

Generally, in the PCR technique, short 
oligonucleotide primers are prepared which match opposite 
ends of a desired sequence. The sequence between the 
primers need not be known. A sample of polynucleotide is 
25 extracted and denatured, preferably by heat, and 

hybridized with oligomers which are oligonucleotide 
primers, which are present in molar excess. 
Polymerization is catalyzed by a template- and primer- 
dependent polymerase in the presence of deoxynucleotide 
30 triphosphates (dNTPs) , and may also be in the presence of 
nucleotide analogs. This results in two "long products" 
which contain the respective primers at their 5' -termini, 
covalently linked to the newly synthesized complements of 
the original strands. The replicated DNA is again 
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denatured, hybridized with oligonucleotide primers, 
returned to polymerizing conditions, and a second cycle 
of replication is initiated. The second cycle provides 
the two original strands, the two long products from 
5 cycle 1, and two "short products" replicated from the 
long products. The short products contain sequences 
(sense or antisense) derived from the target sequence, 
flanked at the 5 # - and 3 ' -termini with primer sequences. 
On each additional cycle, the number of short products is 

10 replicated exponentially . Thus, this process causes 
amplification of a specific target sequence. 

It will be understood that "primer", as used 
herein, may refer to more than one primer, particularly 
in the case where there is some ambiguity in the 

15 information regarding the terminal sequence (s) of the 
target region to be amplified. Hence, a "primer" 
includes a collection of primer oligonucleotides 
containing sequences representing the possible variations 
in the sequence or includes nucleotides which allow a 

20 typical base pairing. One of the primer oligomers in 
this collection will be homologous with the end of the 
target sequence. 

The amplified sequence (s) may then be detected 
using a hybridization assay which utilize nucleic acid 

25 mul timers which bind to single -stranded analyte nucleic 
acid, and which also bind to a multiplicity of 
single- stranded labeled oligonucleotides. A suitable 
solution phase sandwich assay which may be used with 
labeled polynucleotide probes, and the methods for the 

30 preparation of probes is described in EPO 225,807, 
published June 16, 1987. 

The probes can be packaged into diagnostic 
kits. Diagnostic kits include the probe DNA, which may 
be labeled; alternatively, the probe DNA may be unlabeled 
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and the ingredients for labeling may be included in the 
kit in separate containers. The kit may also contain 
other suitably packaged reagents and materials needed for 
the particular hybridization protocol, for example, 
5 standards, buffers, as well as instructions for 
conducting the test using the kit ingredients. 

The pllC DNA sequence information in the clones 
described in the Examples may be used to gain further 
information on the remaining sequence of the pllC gene 
10 from meningococci, for other possible alleles of pllC in 
Neisseria, as well as pllC in other relevant genuses and 
species. This information will aid in the 
characterization of the gene, and of its role in 
virulence of the pathogenic forms of microorganisms, 
15 including, for example, Neisseria, Pseudomonas, 

Bacteroldes , Moraxella, and Vibrio. Moreover, this 
sequence information can lead to additional 
polynucleotide probes, polypeptides derived from pllC, 
multiple pllC loci, and antibodies directed against PilC 
20 epitopes which would be useful for the diagnosis and/or 

treatment of infections caused by the relevant pathogenic 
microorganisms - 

The DNA sequence information in the 
above-mentioned clones is useful for the design of probes 
25 for the isolation of additional DNA sequences which are 
derived from as yet undefined regions of pllC* For 
example, labeled probes containing a sequence of 
approximately 8 or more nucleotides, and preferably 20 or 
more nucleotides, which are derived from regions close to 
30 the ends of the DNA sequences shown in the Examples. 
These probes may be used to isolate overlapping DNA 
sequences within or adjacent to pllC from DNA libraries 
created from genomes of species having type 4 pilins. 
The resulting overlapping DNAs may then be used to 
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synthesize probes for identification of other overlapping 
fragments which do not necessarily overlap the DNAs whose 
sequences are given in the Examples. Thus, it is 
possible to sequence entire pilC genes utilizing the DNA 
5 sequences provided herein and the technique of isolation 
of overlapping DNAs derived from the pilC genes. 

Methods for constructing DNA libraries are 
known in the art, and are discussed infra; for example, a 
method for the construction of pilC libraries in 

10 lambda-gtll is discussed infra in Section IV. A. However, 
DNA libraries which are useful for screening with nucleic 
acid probes may also be constructed in other vectors 
known in the art, for example, lambda-gtlO (Huynh et al. 
(1985) ) . Another suitable vector for the creation of 

15 libraries may be EMBL3, which is a replacement vector 
which accepts inserts ranging from 9 to 23 kb in size. 
In general, methods for constructing DNA libraries is 
discussed in Maniatis et al, MOLECULAR CLONING, 2nd 
edition, (1989) . 

20 The sequence information derived from these 

overlapping pilC DNAs is useful for determining areas of 
homology and heterogeneity within the pilC genets), which 
could indicate the presence of different strains 
gonococci, meningococci, or other hitherto unrecognized 

25 pathogenic forms of Neisseria. It is also useful for the 
design of hybridization probes to detect PilC antigens or 
pilC nucleic acids in biological samples. Moreover, the 
overlapping DNAs may be used to create expression vectors 
for polypeptides derived from pilC gene(s) . 

30 The pile DNA sequence information may also 

allow the construction of additional bacteriostatic 
agents for treatment of neisserial infections, in that 
they may block the expression of PilC and/or pilin 
assembly. For example, it may be used to derive 
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antisense polynucleotides. Antisense polynucleotides 
molecules are comprised of a complementary nucleotide 
sequence which allows them to hybridize specifically to 
designated regions of genomes or RNAs. Antisense 
5 polynucleotides may include, for example, molecules that 
will block protein translation by binding to mRNA, or may 
be molecules which prevent replication of DNA by 
replicase. They may also include molecules which carry 
agents (non-covalently attached or covalently bound) 

10 which cause the mRNA or genomic DNA to be inactive by 
causing, for example, scissions in these molecules . 
Antisense molecules which are to hybridize to pilC 
derived polynucleotides may be designed based upon the 
sequence information of the pilC DNA sequences provided 

15 herein, including those which would be isolated from 

additional DNA libraries. The antibacterial agents based 
upon anti- sense polynucleotides for pilC may be designed 
to bind with high specif icity, to be of increased 
solubility, to be stable, and to have low toxicity. 

20 Hence, they may be delivered in specialized systems, for 
example, liposomes , or by gene therapy. In addition, 
they may include analogs, attached proteins, substituted 
or altered bonding between bases, etc. 

Other types of drugs may be based upon 

25 polynucleotides which "mimic " important control regions 
of the pilC gene, and which may be therapeutic due to 
their interactions with key components of the system 
responsible for expression of the gene. 

In addition to the specific methods described 

30 in the Examples, general methods are known which may be 
used in the practice of the invention. For example, 
general techniques used in extracting the genome from 
bacteria, including Neisseria, preparing and probing a 
DNA library, sequencing clones, constructing expression 
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vectors , transforming cells, performing immunological 
assays such as radioimmunoassays and ELISA assays, for 
rowing cells in culture, and the like are known in the 
rt and laboratory manuals are available describing these 
5 techniques. However, as a general guide, the following 
sets forth some sources currently available for such 
procedures, and for materials useful in carrying them 
out. 

Both prokaryotic and eukaryotic host cells may 

10 be used for expression of desired coding sequences when 
appropriate control sequences which are compatible with 
the designated host are used. Among prokaryotic hosts, 
E. coli is most frequently used. Expression control 
sequences for prokaryotes include promoters, optionally 

15 containing operator portions, and ribosome binding sites. 
Transfer vectors compatible with prokaryotic hosts are 
commonly derived from, for example, pBR322, a plasmid 
containing operons conferring ampicillin and tetracycline 
resistance, and the various pUC vectors, which also 

20 contain sequences conferring antibiotic resistance mark- 
ers. These markers may be used to obtain successful 
transf ortnants by selection. Commonly used prokaryotic 
control sequences include the Beta- lactamase 
(penicillinase) and lactose promoter systems (Chang et 

25 al. (1977)), the tryptophan (trp) promoter system 
(Goeddel et al. (1980)) and the lambda -derived P L 
promoter and N gene ribosome binding site (Shimatake et 
al. (1981)) and the hybrid tac promoter (De Boer et al. 
(1983)) derived from sequences of the trp and lac UV5 

30 promoters. The foregoing systems are particularly 

compatible with E. coli; if desired, other prokaryotic 
hosts such as strains of Bacillus or Pseudomonas may be 
used, with corresponding control sequences. 
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Eukaryotic hosts include yeast and mammalian 
cells in culture systems. Saccharomyces cerevislae and 
Saccharoxn}rces cajrlsbergensis are the most commonly used 
yeast hosts, and are convenient fungal hosts. Yeast 
5 compatible vectors carry markers which permit selection 
of successful transf ormants by conferring prototrophy to 
auxotrophic mutants or resistance to heavy metals on 
wild- type strains. Yeast compatible vectors may employ 
the 2 micron origin of replication (Broach et al. 

10 (1983)), the combination of CEN3 and ARS1 or other means 
for assuring replication, such as sequences which will 
result in incorporation of an appropriate fragment into 
the host cell genome. Control sequences for yeast 
vectors are known in the art and include promoters for 

15 the synthesis of glycolytic enzymes (Hess et al. (1968) ; 
Holland et al. (1978)), including the promoter for 3 
phosphoglycerate kinase (Hitzeman (1980)). Terminators 
may also be included, such as those derived from the 
enolase gene (Holland (1981) ) . Particularly useful 

20 control systems are those which comprise the 

glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter 
or alcohol dehydrogenase (ADH) regulatable promoter, 
terminators also derived from GAPDH, and if secretion is 
desired, leader sequence from yeast alpha factor. In 

25 addition, the transcriptional regulatory region and the 
transcriptional initiation region which are operably 
linked may be such that they are not naturally associated 
in the wild- type organism. These systems are described 
in detail in EPO 120,551, published October 3, 1984; EPO 

30 116,201, published August 22, 1984; and EPO 164,556, 
published December 18, 1985. 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
immortalized cell lines available from the American Type 
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Culture Collection (ATCC) , including HeLa cells, Chinese 
hamster ovary (CHO) cells, baby hamster kidney (BHK) 
cells, and a number of other cell lines. Suitable 
promoters for mammalian cells are also known in the art 
5 and include viral promoters such as that from Simian 

Virus 40 (SV40) (Fiers (1978)), Rous sarcoma virus (RSV) , 
adenovirus (ADV) , and bovine papilloma virus (BPV) * 
Mammalian cells may also require terminator sequences and 
poly A addition sequences; enhancer sequences which 

10 increase expression may also be included, and sequences 
which cause amplification of the gene may also be 
desirable. These sequences are known in the art. 

Vectors suitable for replication in mammalian 
cells are known in the art, and may include viral 

15 replicons, or sequences which insure integration of the 
appropriate sequences encoding PilC epitopes into the 
host genome. 

A vector which is used to express foreign DNA, 
and which may be used in vaccine preparation is Vaccinia 

20 virus. In this case the heterologous DNA is inserted 

into the Vaccinia genome. Techniques for the insertion 
of foreign DNA into the vaccinia virus genome are known 
in the art, and utilize, for example, homologous 
recombination. The insertion of the heterologous DNA is 

25 generally into a gene which is non-essential in nature, 
for example, the thymidine kinase gene (tk) , which also 
provides a selectable marker. Plasmid vectors that 
greatly facilitate the construction of recombinant 
viruses have been described (see, for example, Mackett et 

30 al. (1984), Chakrabarti et al. (1985); Moss (1987)). 
Expression of the polypeptide containing at least one 
immunoreactive PilC epitope then occurs in cells or 
individuals which are immunized with the live recombinant 
vaccinia virus. 



BNSOOCID: ^WO_*21S871A1JU> 



WO 92/13871 



PCT/US92/00863 



-56- 

Other systems for expression of desired 
polypeptides include insect cells and vectors suitable 
for use in these cells. These systems are known in the 
art, and include, for example, insect expression transfer 
5 vectors derived from the baculovirus Autographs. 

calxfornlca nuclear polyhedrosis virus (AcNPV) , which is 
a helper- independent, viral expression vector. 
Expression vectors derived from this system usually use 
the strong viral polyhedrin gene promoter to drive 
10 expression of heterologous genes. Currently the most 
commonly used transfer vector for introducing foreign 
genes into AcNPV is pAc373. Many other vectors, known to 
those of skill in the art, have also been designed for 
improved expression. These include, for example, pVL985 
15 (which alters the polyhedrin start codon from ATG to ATT, 
and which introduces a BamHI cloning site 32 basepairs 
downstream from the ATT; See Luckow and Summers (1989)). 

Methods for the introduction of heterologous 
DNA into the desired site in the baculovirus virus are 
20 known in the art. (See Summer and Smith, Texas 

Agricultural Experiment Station Bulletin No. 1555; Ju et 
al. (1987) ; Smith et aL (1983) ; and Luckow and Summers 
(1989)) . For example, the insertion can be into a gene 
such as the polyhedrin gene, by homologous recombination; 
25 insertion can also be into a restriction enzyme site 
engineered into the desired baculovirus gene. The 
inserted sequences may be those which encode all or 
varying segments of the polyprotein, or other orfs which 
encode viral polypeptides. For example, the insert could 
30 encode the following numbers of amino acid segments from 
the polyprotein: amino acids 1*1078; amino acids 332-662; 
amino acids 406-662; amino acids 156-328, and amino acids 
199-328. 
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The signals for posttranslational 
modifications, such as signal peptide cleavage, 
proteolytic cleavage, and phosphorylation, appear to be 
recognized by insect cells. The signals required for 
5 secretion and nuclear accumulation also appear to be 

conserved between the invertebrate cells and vertebrate 
cells. Examples of the signal sequences from vertebrate 
cells which are effective in invertebrate cells are known 
in the art, for example, the human interleukin 2 signal 

10 (IL2 ) which is a signal for transport out of the cell, 
is recognized and properly removed in insect cells . 

Recombinant polynucleotides are inserted into 
host cells by transformation. Transformation may be by 
any known method for introducing polynucleotides into a 

15 host cell, including, for example packaging the 

polynucleotide in a virus and transducing a host cell 
with the virus, and by direct uptake of the 
polynucleotide- The transformation procedure used 
depends upon the host to be transformed- Bacterial 

20 transformation by direct uptake generally employs 
treatment with calcium or rubidium chloride (Cohen 
(1972); Maniatis (1989)). Yeast transformation by direct 
uptake may be carried out using the method of Hinnen et 
al- (1978) . Mammalian transformations by direct uptake 

25 may be conducted using the calcium phosphate 

precipitation method of Graham and Van der Eb (1978) , or 
the various known modifications thereof. Other methods 
for the introduction of recombinant polynucleotides into 
cells, particularly into mammalian cells, which are known 

30 in the art include dextran-mediated transf ection, calcium 
phosphate mediated transf ection, polybrene mediated 
transf ection, protoplast fusion, electroporation, 
encapsulation of the polynucleotide (s) in liposomes, and 
direct microinjection of the polynucleotides into nuclei. 
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The recombinant polynucleotide may be in the 
form of a vector. Vector construction employs techniques 
which are known in the art. Site -specific DNA cleavage 
is performed by treating with suitable restriction 
5 enzymes under conditions which generally are specified by 
the manufacturer of these commercially available enzymes. 
In general, about 1 microgram of plasmid or DNA sequence 
is cleaved by 1 unit of enzyme in about 20 microliters 
buffer solution by incubation of 1-2 hr at 37° C. After 

10 incubation with the restriction enzyme, protein is 
removed by phenol/chloroform extraction and the DNA 
recovered by precipitation with ethanol. The cleaved 
fragments may be separated using polyacrylamide or 
agarose gel electrophoresis techniques, according to the 

15 general procedures found in Methods in Enzymology (1980) 
652499-560. 

Sticky ended cleavage fragments may be blunt 
ended using E. coll DNA polymerase I (Klenow) in the 
presence of the appropriate deoxynucleotide triphosphates 

20 (dNTPs) present in the mixture. Treatment with SI 

nuclease may also be used, resulting in the hydrolysis of 
any single stranded DNA portions. 

Ligations are carried out using standard buffer 
and temperature conditions using T4 DNA ligase and ATP; 

25 sticky end ligations require less ATP and less ligase 

than blunt end ligations. When vector fragments are used 
as part of a ligation mixture, the vector fragment is 
often treated with bacterial alkaline phosphatase (BAP) 
or calf intestinal alkaline phosphatase to remove the 

30 5 '-phosphate and thus prevent religation of the vector; 
alternatively, restriction enzyme digestion of unwanted 
fragments can be used to prevent ligation. 

Ligation mixtures are transformed into suitable 
cloning hosts, such as E* coll, and successful 
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transf ontiants selected by, for example, antibiotic 
resistance, and screened for the correct construction. 

The desired recombinant DNA sequences may be 
synthesized by synthetic methods. Synthetic 
5 oligonucleotides may be prepared using an automated 
oligonucleotide synthesizer as described by Warner 
(1984) . If desired the synthetic strands may be labeled 
with 32 P by treatment with polynucleotide kinase in the 
presence of P-ATP, using standard conditions for the 
10 reaction. 

DNA sequences, including those isolated from 
DNA libraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as 
described by Zoller (1982) . Briefly, the DNA to be 

15 modified is packaged into phage as a single stranded 

sequence, and converted to a double stranded DNA with DNA 
polymerase using, as a primer, a synthetic 
oligonucleotide complementary to the portion of the DNA 
to be modified, and having the desired modification 

20 included in its own sequence. The resulting double 

stranded DNA is transformed into a phage supporting host 
bacterium. Cultures of the transformed bacteria, which 
contain replications of each strand of the phage, are 
plated in agar to obtain plaques. Theoretically, 50% of 

25 the new plaques contain phage having the mutated 
sequence, and the remaining 50% have the original 
sequence. Replicates of the plaques are hybridized to 
labeled synthetic probe at temperatures and conditions 
which permit hybridization with the correct strand, but 

30 not with the unmodified sequence. The sequences which 
have been identified by hybridization are recovered and 
cloned. 

DNA libraries may be probed using the procedure 
of Grunstein and Hogness (1975) . Briefly, in this 
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procedure, the DNA to be probed is immobilized on nitro- 
cellulose filters, denatured, and prehybridized with a 
buffer containing 0-50% formamide, 0.75 M NaCI, 75 mM Na 
citrate, 0.02% (wt/v) each of bovine serum albumin, poly- 
5 vinyl pyrollidone, and Ficoll, 50 mM Na Phosphate (pH 
6.5), 0.1% SDS, and 100 micrograms/ml carrier denatured 
DNA.. The percentage of formamide in the buffer, as well 
as the time and temperature conditions of the 
prehybridization and subsequent hybridization steps 

10 depends on the stringency required. Oligomeric probes 
which require lower stringency conditions are generally 
used with low percentages of formamide, lower 
temperatures , and longer hybridization times. Probes 
containing more than 30 or 40 nucleotides such as those 

15 derived from cDNA or genomic sequences generally employ 

higher temperatures, e.g., about 40-42°C / and a high 

percentage , e.g., 50%, formamide . Following 

32 

prehybridization, 5'- P-labeled oligonucleotide probe is 
added to the buffer, and the filters are incubated in 

20 this mixture under hybridization conditions. After 
washing, the treated filters are subjected to 
autoradiography to show the location of the hybridized 
probe; DNA in corresponding locations on the original 
agar plates is used as the source of the desired DNA. 

25 For routine vector constructions, ligation 

mixtures are transformed into E. coll strain HB101 or 
other suitable host, and successful transformants 
selected by antibiotic resistance or other markers. 
Plasmids from the transformants are then prepared 

30 according to the method of Clewell et al. (1969), usually 
following chloramphenicol amplification (Clewell (1972)). 
The DNA is isolated and analyzed, usually by restriction 
enzyme analysis and/or sequencing. Sequencing may be by 
the dideoxy method of Sanger et al. (1977) as further 
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described by Messing et al . (1981), or by the method of 
Maxam et al. (19 80) . Problems with band compression, 
which are sometimes observed in GC rich regions, were 
overcome by use of T-deazoguanosine according to Barr et 
5 al, (1986). 

An enzyme -linked immunosorbent assay (ELISA) 
can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of 
an enzyme to either an antigen or an antibody, and uses 

10 the bound enzyme activity as a quantitative label. To 
measure antibody, the known antigen is fixed to a solid 
phase (e.g., a microplate or plastic cup), incubated with 
test serum dilutions, washed, incubated with 
anti- immunoglobulin labeled with an enzyme, and washed 

15 again. Enzymes suitable for labeling are known in the 
art, and include, for example, horseradish peroxidase. 
Enzyme activity bound to the solid phase is measured by 
adding the specific substrate, and determining product 
formation or substrate utilization colorimetrically The 

20 enzyme activity bound is a direct function of the amount 
of antibody bound. 

To measure antigen, a known specific antibody 
is fixed to the solid phase, the test material containing 
antigen is added, after an incubation the solid phase is 

25 washed, and a second enzyme -labeled antibody is added. 

After washing, substrate is added, and enzyme activity is 
estimated colorimetrically, and related to antigen 
concentration . 

Examples 

30 Described below are examples of the present 

invention which are provided only for illustrative 
purposes, and not to limit the scope of the present 
invention. In light of the present disclosure, numerous 
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embodiments within the scope of the claims will be appar- 
ent to those of ordinary skill in the art. 

Exa mple i 

5 Isolation of PilC 

An outer membrane preparation from 
N. gonorrhoeae strain M51l ms (P*) contains small amounts 
of a 110 kd protein, PilC, This protein was enriched 
during alternate cycles of crystallization and 

10 solubilization of pili, unlike other outer membrane 

proteins that decreased in abundance by this procedure. 

The materials and methods used for the 
isolation procedure were the following. 
Bacterial strains and growth conditions 

15 N. gonorrhoeae MSll^ (Meyer et al., 1984) and 

P" and P~n variants of MSll^ (Swanson et al., 1986) were 
kindly obtained from Dr. M. So and from Dr. M. Koorney, 
respectively. The gonococcal isolates UM01 and KH4318 
have previously been described (Norlander et al., 1981). 

20 N. gonorrhoeae strains 605344 and 605103 were obtained 
from Dr. D. Danielsson, Orebro, Sweden, and strain 765 
was isolated at the Department of Bacteriology in Umea, 
Sweden. The commensal Neisseria species N. lactamica 
Nctc 10618 and N. sxibflava GN01 were obtained from 

25 Pharmacia, Uppsala, Sweden. These bacteria were grown at 
37°C in a 5% C0 2 atmosphere on Difco GCB agar containing 
Kellogg' s supplement. Piliated (P + ) and nonpiliated (P~) 
variants were distinguished by colony morphology and 
passed as single colonies. E. coli strain Y 1090 

30 (obtained from Promega Biotech) was used for plaque 

screening, DH5 (Hanahan, 1985) for molecular cloning, 
AA10 recA (Stoker et al. 1984) for isolation of minicells 
and TGI (Gill et al., 1986) for propagation of M13 
clones . 
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Preparation of pili and outer membranes 

Pili were prepared essentially as described by 
Brinton et al . (1978). Gonococci (P + Tr) from 80 GGB 
5 plates, grown for 18 h, were harvested in 0.05 M Tris- 

HC1 pH 8 . 0 and 0 . 15 M NaCl , washed twice and resuspended 
in 40 ml 0.15 M ethanolamine pH 10.5. Pili were sheared 
off in a Sorvall Omnimixer, setting 3 for 30 s. The cell 
debris was pelleted at 13,000 g for 30 min at 4°C and the 

10 supernatant was dialyzed against 0.05 M Tris-HCl pH 8.0 
and 0.15 M NaCl. The crystallized pili were pelleted at 
13,000 g for 60 min., resuspended in 0.15 M ethanolamine 
pH 10.5, and centrifuged at 23,000 g for 60 min. The 
supernatant was dialyzed as described above against 0 . 05 

15 M Tris-HCl pH 8.0 and 0.15 M NaCl. Several cycles of 
crystallization and solubilization were performed to 
produce pili preparations with high purity. Outer 
membranes of N. gonorrhoeae were prepared by the sarkosyl 
method described by Norquist et al. (1978) . 

20 

Example 2 

Preparation of Purified Anti-PilC Antibodies 

The 110 kd protein present in purified MSll^ 
pili preparations was eluted from SDS polyacrylamide gels 
25 and rabbit antibodies were generated against the* gel 
purified protein. The antiserum cross reacted 
extensively with the pilin protein in immunoblots and was 
therefore absorbed with extracts of Pseudomonas putida 
express ing the pilin subunit of N. gonorrhoeae on plasmid 
30 pGC02. 

Pili preparations of N. gonorrhoeae MSll^fP 4 ") 
crystallized 5 times were separated on 10% SDS -poly- 
acrylamide gels using the buffer system of Laemmli 
(1970) . These gels were stained in 0.25 M KC1 and 1 raM 
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DTT for 5 mill, , the 110 kd protein band was sliced out, 
crushed and incubated in a buffer containing 0.05 M Tris- 
HC1 pH 7.9, 0.1 mM EDTA, 5 mM DTT and 0.15 H NaCl at 4°C 
overnight. Gel pieces were removed by cent rifugat ion 
5 prior to immunization of rabbits. 

The achieved 110 kd- antiserum was extensively 
absorbed with Pseudomonas putida 2440 (Bagdasarien et al. 
(1983), carrying a recombinant plasmid, pGC02, 
constructed as follows. The 1.0 kb flpal-EcoRI fragment 
10 of the pilus gene clone pNGHOO (Meyer et al. 1984) 

obtained from M. So was cloned into the flpal and EcoRl 
sites of pMMB66 (Furste et al., 1986). The pllE gene is 
then under control of the tac promoter and induction with 
1 ittM IPTO resulted in high levels of pilin produced in P. 
15 putida 2440, but no extracellular pili structures were 
observed. Dense sonicated cultures of P. putida 
2440/pGC02 were mixed in a 1:1 ratio with the crude 
antiserum. About 15 cycles of 1 h incubation and 30 min 
centrifugation at 25,000 g in the presence of 1 mM PMSF 
20 (phenylmethylsulfonylfluoride) at 4°C were performed. 

The pili antiserum used in immunoblots was 
generated in a rabbit against highly purified pili 
preparations of N. gonorrhoeae MS 11^. 

In immunoblots 10 /ig of boiled bacterial cells 
25 or the same amount of outer membranes were 

electrophoresed on 10% SDS-polyacrylamide gels. The 
proteins were transferred from the gel onto 
nitrocellulose sheets where their immunological cross - 
reaction with the 110 kd absorbed antiserum was tested 
30 using an immunoblotting protocol as described by Towbin 
et al. (1979) . 
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Example 3 

Specificity of Purified Anti-PilC Antibodies 

The absorbed antiserum was used in iinmunoblots 
with whole cell extracts of a number of N. gonorrhoeae 
5 strains as well as commensal strains of Neisseria (Figure 
2A) . All strains of N. gonorrhoeae, except strain 
605103, contained one or two high molecular weight 
protein species reacting with the antiserum. Strain 
605103, unlike the other strains tested, was nonpiliated 

10 and no piliated variants could be obtained suggesting 
that it is a P"n variant (Swanson et al., 1985) . This 
was confirmed by Southern blot hybridization using an 
oligonucleotide probe corresponding to the 5' end of the 
pilE gene. No hybridization was obtained with this 

15 probe. The commensal ff. lactaxnica Nctcl0618, but not N. 
subflava GN01, contained a high molecular weight protein 
reacting with the 110 kd antiserum. Iinmunoblots against 
outer membrane preparations of P + and P"n MSU^ showed 
the 110 kd protein to be present in the outer membrane in 

20 both of these MS11 variants. 

Southern blot hybridization was accomplished as 
follows. Digested genomic DNA was separated on 0.7% 
agarose gels and transferred to nitrocellulose filters 
(Southern, 1975) . After transfer and baking the filters 

25 were prehybridized in a mixture of 5 x SSC, 0.1% SDS, 
5 itiM EDTA, 5 x Denhardt's solution and 100 /ig/ml of 
sonicated calf thymus DNA at 65°C for 2-6 h. 32 P-labeled 
probe (multiprime DNA labelling system, Amersham 
International) was added and hybridization was performed 

30 for 12-15 h at the same temperature. The filters were 
washed in 2 x SSC with 0.1% SDS and in 0.2 x SSC with 
0.1% SDS for 2 x 15 min each, dried and exposed to Kodak 
XRP film at -80°C. 



BN80OCID: ««WO__021 88TIA1JL> 



WO 92/13871 




PCT/US92/00863 



-66- 

A 21-base-long oligonucleotide complementary to 
the signal peptide coding region of pllE (5'- 
GCCTTTTTGAAGGGTATTCAT-3' ) was 32 P-labeled with T4 
polynucleotide kinase and used to probe Clal- digested 
5 genomic DNA. The blot was prehybridized at 37°C in a 

mixture containing 2 x Denhardt's, 0,1% SDS, 2.5 mM EDTA, 
5 x SSC and 100 fig /ml sonicated calf thymus DNA, 
hybridized at 37°C and washed in 2 x SSC for 5 min. 
MSllj^CP*) gave a 4 kb hybridization fragment, whereas 
10 MSllj^tP'n) and 605103 gave no hybridization signal. 

Example 4 
Molecular cloning of the pllCl gene 
encoding a 110 kd protein 

15 Chromosomal DNA from N. gonorrhoeae MSll^fP*) 

was used to construct a Xgtll library. The library was 
screened with the absorbed 110 kd antiserum and one 
positive clone out of 10,000 plaques was found, 
containing an 800 bp insert. A lysogen of this positive 

20 Xgtll clone was examined in immunoblots and a fusion 

protein with an estimated size of 150 kd reacted with the 
antiserum (data not shown) . The 800 bp insert was 
purified, labeled with 32 P, and used as a probe to screen 
a plasmid library from N. gonorrhoeae MSll ms . Six clones 

25 out of 10,000 hybridized with the probe. Restriction 

maps for these partially overlapping six clones are shown 
in Figure 1. 

In Figure 1, plasmids pABJ04-09, which all 
belong to locus 1, were isolated from a plasmid library 

30 using the 800 bp insert from Xgtll as a probe. The Xgtll 
insert (from locus 2) has an additional Sa.ll site not 
found in the plasmid clones. The position of the pllCl 
gene and direction of its transcription (indicated .by an 
arrow) were determined in E. coll minicells. Three 
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thickened lines, with an arrow-head at each end, indicate 
fragments used as probes in Southern hybridizations . 
I.e., the 800 bp insert from Xgtll, the jE?coRV 1 -BcoRV 2 
(1.3kb) and the EcoRV 3 -HindIII 4 (0.8 kb) fragments of 
5 pAGJ04. Triangles mark the location of two mTnCm 
insertions in pABJ04. The resulting plasmids, 
pABJ)4: :mTnCm-12 and pABJ04 : rmTnCm- 14 were used, were 
used to inactivate pilCl and pllC2. 

The six plasmid clones, pABJ04-09, were 

10 transformed into the minicell producing strain AA10 to 
monitor expression of plasmid encoded [ 3 5 S] methionine 
labeled proteins. The F. coll minicell strain AA10 was 
transformed with plasmid DNA (pABJ04-09) and chromosome 
deficient minicells from these strains were purified over 

15 sucrose gradients (Thompson and Achtman, 1978) . The 

plasmid- encoded proteins were labeled in the presence of 
80 pCl [ 3 5 S] methionine in minimal salts medium and 1% 
methionine assay medium (Dif co) . After lysis of the 
minicells in sample buffer (Laemmli, 1970) the proteins 

20 were electrophoresed on an SDS-polyacryl amide gel, the 

gel was dried and exposed to X-ray film (Kodak X-OmatAR) . 

Plasmid pABCT04 expressed minute amounts of 
three high molecular weight proteins, 113, 111 and 108 kd 
in size, as well as a number of lower molecular weight 

25 protein species not produced from the vector control. 
The three high molecular weight bands were missing in 
pABJOS and pABJ06 but three novel lower molecular weight 
protein species had appeared, suggesting that pABJOS and 
pABJ06 are deleted for the .3' end of a gene, denoted 

30 pilCl, and that this gene is responsible for all three 
high molecular weight species. This suggested that the 
distal end of the gene must be located between the Mlul^ 
and MuII 2 sites (Figure 1) . The observation that plasmid 
pABJ07 did not express any high molecular proteins 

35 
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tentatively located the 5' end of the gene to a region 
0.5-1.2 kb to the right of the EcoRV 3 site. The size for 
a gene encoding a 110 kd protein is ~3 kb which is in 
agreement with these mapping data. 

5 

Example 5 

Identification of a Second Gene Encoding PilC 

The 800 bp insert in Xgtll contains a single 
Sail site not present in the region on pABJ04 which 

10 hybridized to this fragment, suggesting that there is 

more than one pllC locus in the genome of N. gonorrhoeae 
MSll. This was confirmed in Southern blot hybridizations 
in which three different pllC fragments were used to 
probe SznaX and Clal digested genomic DNA. The 800 bp 

15 fragment from Xgtll hybridized in a Southern blot to two 
Clal (18 and 8 kb) and Smal (13 and 4.5 kb) fragments of 
DNA prepared from N. gonorrhoeae MSll^. Since the probe 
does not contain any internal Clal or Smal sites, there 
are presumably two copies of the 3 ' end of p±lC in the 

20 MSll genome. The 1.3 kb EcoRV^ - EcoRV 2 fragment of 

pABJ04 carries the central region of pilCl. This probe 
hybridized to the same two Clal fragments and to four 
Smal fragments, two of which are the same size as the two 
fimal fragments identified with the 800 bp probe (13 kb 

25 and 4.5 kb) . Hybridization with the 800 bp probe was 
more extensive to the 8 kb Clal and the 4 kb Smal 
fragment whereas the reverse was found with the 1.3 kb 
-BcoRV 1 - EcoRV 2 fragment from pABJ04 strongly suggesting 
that the two genomic copies of pllC show a significant 

30 sequence variation in the 3' as well as in the central 

region. A probe corresponding to the 5' region of pllCl 
was also used in Southern hybridization experiments. 
This 0.8 kb HindIII 4 - JEcoRV 3 fragment hybridized to two 
Clal (18 kb and 4 kb) and Smal (25 kb and 7 kb) fragments 
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with seemingly equal efficiency. The hybridization 
pattern was identical using DNA from MSII^. Taken 
together these hybridization data indicate that 
N. gonorrhoeae MS11 contains two complete copies of pilC. 
5 Furthermore the two genes appear to be more homologous in 
their 5' as compared to their central and 3' regions. 

The results indicate that the 800 bp insert 
from Xgtll carries information from pllC2 whereas the 
clones pABJ04-09 must carry information from pilCl. 

10 Finally p±lC2 must be located >2 kb from either end of 
pilCl. The DNA sequence of the 3' -end of the pilC2 
fragment is shown in Figure 7. The sequence showing the 
putative amino acids encoded therein are shown in Figure 
8. A comparison of the analogous portions of p±lC2 (top) 

15 and pilCl (bottom) DNA sequences, and the putative amino 
acids encoded therein are shown in Figure 9* 

The 800 bp fragment from p±lC2 was also used to 
probe digested genomic DNA from N. gonorrhoeae strains 
UM01, 765 and 605103. The latter isolate does not 

20 express detectable levels of the 110 kd protein. Strain 
UM01, unlike MS11, contained only one Clal fragment of 15 
kb that hybridized to the probe (data not shown) . Hence, 
this strain may contain only one copy of pile. Strain 
605103 and 765, on the other hand, each seem to contain 

25 two copies of pile since two Clal and two Sroal fragments 
hybridized to the 800 bp probe. 

The commensal N. lactamica Nctcl0618 DNA 
digested with Clal and Smal also hybridized with the 800 
bp probe. Since only one band hybridized in each case 

30 this strain may contain only one copy of pilC. In 

contrast, N. sub f lava GN01 did not hybridize to the 800 
bp pilC2 probe using the same stringency. 

35 
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Examole 6 
Characterization of the pile Genes 

The pilCl gene on pABi704 is translationally out of frame 
5 The amino terminal sequence of gel purified 110 

kd protein from strain MSll^ (P + ) was determined by 
sequential Edman degradation. For aminoterminal sequence 
determination automated Edman degradations (Edman and 
Bregg, 1967) were performed in an updated Beckman 890C 
10 spinning cup sequencing sequencer. The sequencing 

procedure and the method for analysis of the 3 -phenyl -2- 
thiohydantoin derivatives been described (Engstrom et 
al., 1984). Considerable difficulties were encountered 
in the method probably due to blocking of the N- terminus. 
15 As a result, only the residues from position 4 to 10 were 
obtained (Figure 2) . 

The 3.3 kb i2±adIII 4 - MluX^ fragment 
encompassing the entire pilCl gene was sequenced on both 
strands using the dideoxy sequencing method adapted for 
20 single stranded DNA. 

Purified DNA fragments from pABJ04 and PCR- 
amplified 5' end of pilCl was subcloned into M13 vectors 
(Sanger et al., 1980: Yanish-Perroa et al., 1985) and 
sequenced using the chain termination method of Sanger et 
25 al* (1977) * Primers used were the M13 17-mer universal 
primer and oligonucleotides synthesized at Symbicon, 
Umea, Sweden or at the Department of Biochemistry, 
Washington University, St., Louis, MO, USA. 

The results of the sequencing showed that the 
30 pilCl contained one single open reading frame of 997 

codons (from left to right in Figure 2) and starting at 
an AUG codon 195 bp from the HindXII 4 site. Codons 7-12 
in this open reading frame corresponded to amino acids 5- 
10 in the sequence of the gel purified protein. The AUG 
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codon in the beginning of the long open reading frame was 
not preceded by a typical Shine- Dal garno sequence. 
Moreover, since the 110 kd protein is located in the 
outer membrane of N. gonorrhoeae, we expected the protein 
5 to be translated with a signal sequence. When examining 
the nucleotide sequence, an AUG codon was found in frame 
1 that was preceded by a typical Shine -Dalgarno sequence 
( -AGGAA- ) . The sequence following this AUG codon would 
encode a typical signal peptide with basic amino acids in 

10 the amino terminal region and a hydrophobic central 

region. However, no signal peptidase cleavage site could 
be predicted following the rules of von Heijne (1983) . A 
tract of 12G residues was found in the region encoding 
the putative signal peptide for PilC. Addition of one G 

15 residue or the loss of two would align the long open 
reading frame with the AUG codon in frame 1* The 
translated region in frame 2 contains a putative signal 
peptidase cleavage site between Ala and Gin. A cleavage 
at this site would align the determined amino acid 

20 sequence at positions 5-10 for the 110 kd protein with 
the deduced amino acid sequence* The data therefore 
suggested that the cloned pilCl gene is out of frame due 
to f rameshif ting in the region encoding the signal 
peptide . 

25 Figure 2 shows the nucleotide sequence and the 

deduced amino acid sequence of the 5' -end of pile. The 
amino- terminal sequence of gel purified PilC from 
MSll mg (P + ) is shown in a box below frame 2, a 997 amino 
acid long open reading frame that would code for a 

30 protein about 110 kd in size. Frame 1 contains 41 amino 
acids and is preceded by a putative Shine -Dalgarno 
sequence (underlined) . Two horizontal lines mark a 
stretch of 12 G residues. An addition of one G in this 
region would align the ATG (boxed) in frame 1 with frame 
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2. Numbers above the sequence show base positions 
relative to the ffindIII 4 site (=0) located on pABJ04. 
The position of two 24 bp oligonucleotide primers 
(opposite stands) used for PCR amplification, are 
5 indicated above the sequence by hatched bars. 

Figure 3 shows the nucleotide sequence of the 
sense strand of the pilCl gene. 

Figure 4 shows the shows the nucleotide 
sequence of the sense strand of the pilCl gene and the 
10 amino acids encoded therein. 

Genetic inactivation of pilC2 but not pilCl 
abolishes expression of the 110 kd protein in MS11 

Plasmid pABJ04 was mutagenized in E. coll by a 

15 transposon mini-Tn3 derivative, mTnCm. The shuttle 

mutagenesis system developed by Seifert et al., (1986) 
using a miniTn3 carrying the chloramphenicol resistance 
gene was kindly provided by Dr. M. So. Mutagenesis of 
pABJ04 with mTnCm and transformation of N. gonorrhoeae 

20 were performed as previously described (Seifert et al., 
1990) . MiniTnCm insertions at 30 different positions in 
pABCT04 were identified, two of which mapped within the 
PilC gene. Piliated N. gonorrhoeae MS 11^ were 
transformed with 2 fig plasmid DNA, transf ormants were 

25 selected for on plates containing 10 fig/wl 

chloramphenicol for the single mutants and 30 fig/ml 
chloramphenicol for the double mutants. 

Only two mTnCm insertions had occurred in 
pilCl (Figure 1) . Truncated protein species were seen in 

30 minicells with the mTnCm-14 insertion located 0.5 kb from 
the 3' end of pilCl but not with the mTnCm-12 insertion 
located 0.5 kb from the 5' end of the gene. Both 
insertion mutants were used in a gene replacement 
experiment . Plasmids pABJ04: :mTnCm-12 and pAB J04 : rmTnCm- 
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14 were linearized with BamHI and transformed into 
N. gonorrhoeae MSll^CP" 1 ") and transf ormants resistant to 
10 /zg/ml of chloramphenicol were selected. Forty- eight 
P + transf ormants (24 from each experiment) were assayed 
5 for the presence of PilC in immunoblots . All these 
transf ormants remained capable of expressing the PilC 
protein. Genomic DNA was prepared from seven of the 
chloramphenicol resistant transf ormants (five from 
pABJ04: :mTnCm-12 and two from pABJ04 : :mTnCm-14) , cleaved 

10 with Clal and PvuII and used in Southern blot experiments 
using the EcoRV 1 - EcoRV 2 fragment of pABJ04 as a probe. 
The 8 kb Clal fragment was unaffected in the mutants 
whereas the 18 kb Clal fragment had been replaced by a 20 
kb fragment. PvuII cleaves within the 1.6 kb mTnCm 

15 element. The probe detected an 8 kb PvuII fragment in 
both parent and mutant DNA. In the mutants, a novel 
PvuII fragment appeared that was 6.2 kb in size in five 
transf ormants obtained with pABJ04 : :mTnCm-12 and 4.8 kb 
in size in two transf ormants with pABJ04 : :mTnCm-14. To 

20 confirm the insertion of mTnCm, a 250 bp EcoRI-Hindlll 
fragment of the CAT GenBlock (Pharmacia, Sweden) , 
containing the PvuII site, was used as a probe. It 
detected the larger of the two Clal fragments as well as 
the 6.2 kb PvuII fragment. In addition, a 2 kb PvuII 

25 fragment not covered with the pllC probe was detected. 
These data demonstrate that we have obtained gene 
replacements in pilCl, whereas p±lC2 was unaffected in 
all seven P + , PilC + transf ormants . A rapid hybridization 
was done to screen the remaining 41 P + transf ormants . 

30 All but one had mTnCm inserted in pilCT. The remaining 
trans formant had an intact locus 1 and 2 and must 
therefore contain mTnCm elsewhere in the gonococcal 
chromosome . 
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In the same transformation experiments, the 
frequency of P~ colony variants was about five- fold 
higher as compared with that occurring normally in strain 
MSll^CP*) . Two P'mTnCm-12 transf ormants isolated at 10 
5 /ig/ml of chloramphenicol were also analyzed by Southern 
blot hybridization using the EcoKV^ - EcoRv 2 fragment of 
pABJ04 and the EcoRI - Hindlll fragment of the CAT 
GenBlock. Each of these mutants carried mTnCm in pilC2 
as evidenced by a replacement of the 8 kb Clal fragment 
10 by a fragment 9.5 kb in size that hybridizes to both 
probes* These pll C2 : rmTnCm insertion mutants did not 
express PilC as determined by immunoblot analysis. 

A P + , pllClz rmTnCm- 12 mutant was retransf ormed 
with DNA prepared from a P", pilC2: :mTnCm-12 mutant and 
15 colonies growing at 30 /ig/ml of chloramphenicol were 

selected to obtain double mutants in pilC. All resistant 
transf ormants were P~, and when analyzed by Southern blot 
hybridization all contained mTnCm in both pllCl and 
pllC2. Electron microscopy revealed that the P + , 
20 pllClz zmTnCm- 12 mutant still expressed pili albeit at a 
slightly lower level than the MSll mk (P + ) parental clone, 
whereas the P" , pilC2z rmTnCm- 12 was completely bald as 
was the pilCl, pilC2 double mutant. 

Immunoblot analyses were performed on the P + 
25 pllClz rmTnCm- 12 mutant, the P", pilC2: rmTnCm- 12 mutant 
and the P", pllClz rmTnCm- 12, pilC2: rmTnCm- 12 double 
mutant, using PilC and pili antisera. Inactivation of 
pllCl did not abolish expression of PilC or the pilin, 
Inactivation of p±lC2 totally abolished expression of 
30 PilC but did not affect expression of pilin. The pllCl, 
p±lC2 double mutant was PilC" but produced only low 
levels of pilin. Taken together these data imply that 
p±lC2 but not pllCl is expressing PilC in the MS11 
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variant under study. Moreover, inactivation of p±lC2 but 
not pilCl was associated with a loss of piliation. 

P + revertants occurred spontaneously at a low 
frequency in the pllC2: :mTnCm- 12 mutants. These 
5 revertants expressed pili as determined by electron 

microscopy and also expressed PilC. It is likely that 
PilC expression is due to in- frame switching in pilCl. 

Figure 5 shows the nucleotide sequence of the 
sense strand of the pilCl gene, and the effect of frame 
10 shift on the putative gene products encoded therein. 

The pllC genes of N. gonorrhoeae vary 

in the length of the G tract 
Polymerase chain reaction (PCR) with Tag 
15 polymerase was used to analyze the 5' region of pile 

using two 24 base long synthetic oligonucleotides based 
on the sequence of pilCl (Figure 2) . These 
oligonucleotides would generate an amplified fragment of 
149 bases as judged from the sequence obtained from 
20 pABJ04. 

Polymerase chain reaction was carried out in 
100 fil containing 50 ng of genomic DNA or 5 ng of plasmid 
DNA. 1.0 fM of each oligonucleotide, 200 fiM of each 
nucleotide, 0.001% gelatin, 1.5 mM MgCl 2 , 10 mM Tris (pH 

25 8.3), 50 mM KC1, 0.25 fil 1 mCi/ml [ 32 P]dATP and 2 U of 
Taq Polymerase (Perkin Elmer Cetus) . The samples were 
passed through 25 cycles: 2 min at 50 °C, 1 min at 94 °C 
and 3 min at 72 °C in a Thermal Cycler (Perkin Elmer 
Cetus) . Aliquots of the DNA fragments were denatured at 

30 95 °C for 2 min and electrophoresed on standard denaturing 
sequencing gels. 

The amplified products from MSll^tP*) DNA were 
149 and 150 long respectively. In addition, two less 
abundant products of 151 and 148 bases were seen. The 
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amplif ied products were electroeluted and cloned into 
M13mpl8, and twenty phage clones were sequenced using a 
universal primer. Four different sequences were obtained 
(Figure 6) . 

5 Figure 6 shows the nucleotide sequence of PGR 

amplif ied fragments demonstrating a variation in the 
length of the G tract and sequence differences in the 5 ' - 
region of the pile genes. The two oligonucleotide 
primers used for the PCR are shown in Figure 2. 
10 Amplified DNA was cloned into M13mp8 and sequenced. 

Shown are the complete nucleotide sequence in between the 
two primers. In- frame sequences are translated and the G 
stretches are underlined. The putative cleavage sites 
are marked with arrows. Genomic DNA from N. gonorrheae 
15 strains MS11 (P + .PilC + ) , UM01 (P + .PilC + ) , 765 (P + .PilC + ) 
and 605103 (P'n.PilC"), and purified DNA from 
PABJ04/AA10 (recA) was used in the PCR. 

Variant patterns la and lb were identical to 
each other and to the cloned sequence on pABJ04 except 
20 for the presence of 11 instead of 12 G residues in the G 
tract of lb. The G tract of sequence 2a was 13 residues 
long indicating that the sequence is in frame. In 
addition, this sequence differed from pilCl by four 
basepair substitutions outside the G tract, including an 
25 AAA lysine codon four triplets downstream of the putative 
signal peptide processing site which is in agreement with 
the lysine residue found in the fourth position of the 
gel purified 110 kd PilC protein. Sequences la and lb 
contained CAA, the codon for Gin, at the same position. 
30 Sequence 2b was identical to 2a except for the presence 
of 12 G residues in the G tract. These data are 
compatible with sequence 1 being from pilCl and sequence 
2 from pilC2 and further support that pilC2 must be the 
expressed locus in the MS11 (P + ) variant we are studying. 
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S train UM01 apparently only contains one copy 
of pllC. DNA from this strain generated five amplified 
fragments ranging in size from 148 to 152 bp in the PCR 
reaction. The most abundant fragments were 149-151 bp 
5 long. Among ten M13 clones, three variant 1 sequences 
were found (a,b,c) that differed only in the number of G 
residues (11-13) in the G tract (Figure 6) supporting the 
hybridization data that this strain contains only one 
pilC gene. Since a PilC protein is expressed from UM01 
10 we suggest that the majority of cells has 13 Gs in the G 
tract . 

Strain 765 contains two pilC loci, both of 
which seem to be translationally ON based on the presence 
of two high molecular weight proteins reacting with the 

15 absorbed PilC antiserum. A number of amplified fragments 
were seen after the PCR reaction ranging in size from 149 
to 153 bases. Three variant sequences were found among 
nine clones (Figure 6) . The G tract was 13 residues long 
in variant 3a (in frame) and 14 (out of frame) in variant 

20 3b whereas sequence variant 4 contained 11 G residues in 
the G tract. Variant sequence 4 contained four 
additional nucleotides (-CAGG-) distal to the G tract 
relative to variant sequences 1, 2 and 3, indicating that 
the amplified product with 11 Gs from this variant 

25 sequence is 152 long and out of frame. Two PCR amplified 
products 152 and 153 in length were obtained from strain 
765 suggesting that in frame variants of sequence 4 might 
be present in the DNA prepared from this strain. 

Strain 605103 carries two pilC copies, both of 

30 which seem to be translationally OFF. The amplified 

fragments were 148 and 149 bases in size. Out of eight 
M13 clones only variant la and lb sequences were found, 
with 11 and 12 Gs in the G tract respectively. 
Consequently, we were unable to find an in frame sequence 
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variant from this strain. We do not know if the 5' ends 
of the two pilC genes are identical in this strain or if 
one pilC gene differed from pilCl in the region 
corresponding to the oligonucleotides used for 
5 amplification. In the latter case we would not expect to 
obtain any amplified products from the second copy. 

The only in frame variant found in DNA 
amplified from N. gonorrhoeae carried 13 6s in the 6 
tract. To see if variants with 106s arise in products 
10 expressed in E. coli, PCR amplified products were 

generated from pABJ04 purified from E. coli strain AA10, 
using the same two oligonucleotide primers as before. 
Out of 12 sequenced clones, two carried 10 Gs in the 6 
tract (Figure 6) . The majority of clones (seven) carried 
15 12 6s as expected. It is likely that the PCR 

amplification products are not representative of the 
original DNA population. However, the distribution of 
variation in the 6 tract is consistent with a model in 
which only one 6 residue is gained or lost at one given 
20 event. Since AA10 is recA, frameshift mutations in the 6 
tract in E. coli occur independently of the RecA protein. 

N. meningitidis contains two pilC loci 
Southern blot hybridizations using MS11 pilCl' specific 

25 probes identified multiple fragments when meningococcal 
genomic DNA is digested with a variety of restriction 
endonucleases . PCR amplification using two 24-base 
oligonucleotides from the 5' end of MS11 pilCl as primers 
yields multiple fragments ranging in size from 148 to 151 

30 bases. DNA sequencing of fragments cloned into phage M13 
identifies two classes of sequences, as in N. 
gonorrhoeae, which differ outside the 6- tract. Variation 
occurred within each class with respect to the number of 
G's in the 6- tract. Therefore, N. meningitidis must 
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carry two pilC loci, as does N. gonorrhoeae, which should 
also be under the control of translational frame 
shifting. 

Both pile loci are cloned from N. meningitidis 
5 by generating an EMBL3 library and screening this library 
with pilCl' and pilC2~ specific DNA from N. gonorrhoeae 
strain MS11. 

Genomic DNA from W. meningitidis is partially 
digested with Sau3A and fragments ranging from 9 to 20 kB 

10 are ligated into the lambda EMBL3 vector. Because of the 
packaging constraints of the phage, only those lambdas 
which contain DNA fragments of this size will be packaged 
(i.e., are viable). The library thus constructed can be 
screened with genomic oligonucleotide or cloned gene 

15 probes following selection in a lysogenic E. coli strain. 
(Frischart, A.M. et al (1983), J. Mol. Biol. 170:827). 
Preferably, full length clones are identified by 
screening for clones hybridizing to both the 5 ' and 3 ' 
ends of pilC. If full length clones cannot be obtained 

20 from the EMBL3 library, pile specific probes may be used 
to screen a plasmid library from the same strains. 

Translational fusion proteins with /S- 
galactosidase may also be screened for in a Xgtll 
library, using 0-galactosidase and PilC" specific 

25 antisera in Western immunoblots. /3-galactosidase-PilCl 
and 0-galactosidase-PilC2 fusion proteins are purified 
from the cytoplasm of recombinant E. coli and used to 
raise specific antisera. 



3 0 Example 7 

The immunobioloaical properties of PilC 
PilC is located in the outer membrane of 
Neisseria. The immune response during natural infection 
can be assessed by screening convalescent sera for anti- 

35 
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PilC antibody. The presence of only two pllC loci 
suggests that PilC is only moderately variable, however. 
This together with its essential role in pilus biogenesis 
makes PilC attractive as a potential vaccine candidate. 
5 Two types of PilC translational fusions using 

alkaline phosphatase and /J-galactosidase are generated. 
In the first instance a secreted fusion protein is 
obtained that may associate with the outer membrane. In 
the second instance the fusion proteins may accumulate in 
10 the cytoplasm as inclusion bodies. The construction 

schemes for such fusion proteins uses techniques known in 
the art. TnpiioA insertions on plasmid pABJ04 in E. coll 
are generated, and a PhoA + phenotype is screened for as 
blue colonies on media containing the chromogenic 
15 substrate XP. If such clones have the phoA gene in frame 
with an in frame variant of pilCl the fusion product 
should be able to cross the cytoplasmic membrane where it 
can be analyzed by Western immunoblots using an alkaline 
phosphatase specific antiserum and our PilC antiserum 
20 raised against gel-purified PilC2 from MS11(P + ) . 

LacZ: zpllC fusions are generated by cloning different 
segments of pllC into a la.cZ' containing vector used to 
generate translational fusions. Similar constructs are 
performed on each of the two pllC genes from N. 
25 meningitidis. Antisera are generated against fusion 
proteins after their purification using conventional 
protocols. These antisera are extensively adsorbed with 
extracts of E. coll expressing alkaline phosphatase and 
/S-galactosidase, and used in Western immunoblots and 
30 ELISA assays against a panel of Neisseria gonorrhoeae and 
Neisseria meningi tldls strains. Antisera raised against 
fusion proteins carrying the major portion of PilC are 
also analyzed in Western blots using E. coll expressing 
fusion proteins containing only smaller regions of PilC. 
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The results of these studies should show which regions in 
PilC are immunodominant* 

The pllCl and pilC2 genes are highly homologous in their 
5' ends whereas the homology is considerably less 
5 pronounced in the central and 3' region. 

In addition, the entire pilC2 gene from N. 
gonorrhoeae MS11(P + ) is cloned and sequenced. Algorithms 
are used to search for potential T-cell epitopes 
{amphipathic helical conformation) and 0-cell epitopes. 

10 Polypeptides containing the predicted epitopes are tested 
to determine if they can prime mice for an enhanced 
immune response to PilCl and PilC2 . 

Specific PilC antisera are used in 
immunoelectromicroscopy with piliated Neisseria cells as 

15 well as with purified pili to see if PilC is physically 
connected with the pilus fiber. 

Neisseria is grown in the presence of different 
dilutions of PilC" specific antibodies. Bactericidal 
effect exerted by the antiserum, effects on piliation, 

20 and effects on bacterial attachment to corneal primary 
culture cells are monitored. Binding assays to 
epithelial cells are described in Tjia, K.F. et al. 
(1988), Graefe's Arch. Clin. Exp. Opthalmol. 226:341- 
345. 

25 

Example 8 

Identification and characterization of genes 
looted ^qjacent tp pilQ 

30 The pilCl and pilC2 loci are part of a larger duplication 
that extends both 5'- and 3'- of pilC. We know from our 
work with E. coli that strains may contain multiple gene 
clusters for the same class of pili. In one case we have 
shown that the only difference between two duplicated 
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gene clusters (pap and prs) resides in the adhesion genes 
such that each cluster gives rise to serologically 
identical pili binding to different cell surface 
receptors . 

5 mTncm mutagenesis in the region upstream and 

downstream of pilCl is performed to generate allelic 
replacements in the pxlCl and p±lC2 regions on the 
chromosome. Since the two regions are highly homologous 
we expect to obtain for each insertion allelic 

10 replacements in either region. Double mutants are 
generated as before by isolating DNA from mutants 
carrying insertions in the p±lC2 region transforming P 
variants carrying the same insertion in the pilCl region 
and select for transformants resistant to 30/ig/ml of 

15 chloramphenicol . These double mutants are examined for 
piliation, pilins expression, and binding to corneal 
primary culture cells. 

Example ? 

20 Phase variation in gonococcal pili expression 

can be caused by fram eshif t mutations in pile 
If PilC is required for pilus formation, we 
would expect some P" progeny arising from a P + clone to 
accumulate unassembled pilin in the absence of PilC. 

25 Nonpiliated (P") colonies were derived from MSllj^fP*}, 

restreaked, and tested for the presence of PilC and pilin 
in immunoblots with the PilC and pili antisera. Five out 
of eight P~ clones did not produce detectable levels of 
PilC, but expressed the pilin subunit. The remaining 

30 three P" clones expressed PilC but not pilin. The 

molecular mass of the pilin subunit was the same in the 
P~, PilC" variants as in MSll^fP*^, PilC + ) . However, the 
former in addition produced a protein reacting with the 
pili antiserum that was 16 kd in size. Since MSll^ only 
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contains one expression site for pilin we believe that 
this protein species represents a proteolytic degradation 
product of the pilin and may be identical to the S -pilin 
previously described (Haas et al., 1987). Several 
5 independent P" clones were isolated from one P" , PilC" 
clone. They all remained PilC" and retained expression 
of pilin. Piliated (P + ) revertants were also obtained 
from the same P" , PilC" clone. These P + revertants 
occurred at about a tenfold lower frequency (10 ) than 

10 P" derivatives from a P + clone. All P + revertants from a 
P" , PilC" clone had regained expression of PilC. All but 
one expressed a pilin with the same molecular weight as 
the nonpiliated parent. However, the low molecular 
weight pilin degradation product was much less abundant 

15 in the P + , PilC" revertants. It was possible to obtain 
P + revertants from other P", PilC" clones as well, all of 
which expressed PilC. 

The pllE gene from one set of PilC switches was 
PCR amplified and sequenced directly. The P" , PilC", 

20 pilin + variant 8 carried eight amino acid changes in the 
pilin relative to the parental clone MSll^^ The pilin 
sequence of the P + , PilC" backswitcher 8:1 was identical 
to variant 8. Thus, the backswitching from P"to P + 
colony morphology was not associated with any alteration 

25 in the pilus subunit protein implying that the change in 
colonial morphology was due to the switch in PilC 
expression. 

Strain MSll^tP") , variants 8(P") and 8:1 (P + ) 
were also examined by transmission electron microscopy. 
30 Electron microscopy was performed with a JEOL 100CX 

microscope with 200 -mesh copper grids coated with thin 
films of 2% Formvar. The bacterial colonies were 
carefully overlaid with buffer [10 fM Tris-HCl (pH 7.5), 
10 /zM magnesium chloride] and the cells were allowed to 
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sediment for 15 min on a grid. The grids were washed 
with water, negatively stained with 1% sodium 
silicotungstate (pH 7.0) and then washed again. 

The electron micrographs showed that the 
5 MS11 (P + ) parental cells were heavily piliated and pili 
were of ten seen to aggregate. In contrast most cells of 
variant 8(P~) were nonpiliated. One or two pili were 
found on -10% of these cells. All cells of variant 8:1 
(P + ) were piliated, carrying -10-40 fibers/cells. No 

10 aggregation of individual fibers was seen. These data 

confirm that the observed changes in colonial morphology 
reflect alterations in expression of pili. Therefore, 
phase variation of gonococcal pili may not only be caused 
by recombination events occurring in the pllE locus 

15 (Bergstrom et al, .1986; Swanson et al., 1986) but also by 
frameshif t mutations in pile. 

Example 10 
Immunoaenicity of PilC 

20 In order to predict a region of PilC which 

would have a high probability for antigenicity, residues 
300 to 700 of the putative PilCl protein encoded within 
pllCl were analyzed for antigen index, hydrophilicity, 
and hydrophobicity using standard computer -modelling 

25 methods. The analysis indicated that the PilCl 

polypeptide fragment containing residues 300 to 700 would 
have several regions with a high antigen index, high 
hydrophilicity, and a high likelihood for location in an 
external domain. 

30 The immunogenicity of a recombinant polypeptide 

expressed from the DNA encoding amino acids 300 to 700 
was examined. The region of DNA encoding amino acid 
residues 300 to 700 was amplified by polymerase chain 

35 
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reaction (PGR) using the following oligonucleotide 
primers . 

5' GGC TAG GTG GCA TAT GAA AGA TAC CGG 3' 
5 and 

5' TTT GCA ATC GG£ GAT CCT * C*A*G GTG TCT TTC 3' 

These primers incorporate an Ndel and a BamHI restriction 
endonuclease site (indicated by the underlined 

10 nucleotides) , respectively. A termination codon 

(indicated by the asterisks) was also incorporated. The 
PCR amplified DNA was then ligated into the vector pET3a 
(between the Ndel and BamHI sites) . The recombinant 
vector pET3a is used in the inducible expression system 

15 described by Studier et al. (1990) , using the protocol 
described therein. Strain BL21 (DE3) was transformed 
with the pET3a-pi2C (300-700) vector, and the transformed 
strain used for the expression of the PilC (300-700) 
peptide. 

20 The expression products after induction were 

analyzed by sodium dodecyl sulf ate-polyacrylamide gel 
electrophoresis (PAGE) using an 8% acrylamide gel and the 
standard 25 znM Tris base / 250 mM glycine / pH 8.3 / 0.1% 
sodium dodecyl sulfate (SDS) electrophoresis buffer. 

25 After electrophoresis , the gel was fixed and then stained 
with Coomassie Blue according to standard protocols, and 
the production of PilC (300-700) was confirmed by the 
detection of the presence in the gels of an abundant, 
appropriately- sized peptide of approximately 46kD. 

30 In order to detect the immunogenicity of the 

PilC (300-700) product, the region of the SDS -PAGE gel 
containing the PilC (300- 700) polypeptide was excised from 
parallel unstained lanes, homogenized, and the protein 
eluted into a buffer of 0.1% Triton X-100 in water by 

35 
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passive diffusion. Rabbits were given a priming 
intradermal injection of homogenized gel slices 
(containing approximately 500/ig PilC (300-700) protein), 
followed 3 weeks later by a subcutaneous boost (of 
5 approximately 500 fig of eluted protein) . An initial test 
serum was then collected after an additional 14 days. 
All of the test animals yielded a specific high titer 
antibody response. The antibodies induced by PilC (300- 
700) were immunologically reactive not only with that 
10 polypeptide (i.e., PilCl (300-700) ) , but also with native 
PilCl and native PilC2. 

The results demonstrate, inter alia, the 
following. PilC contains antigenic epitopes that can 
elicit a strong immunogenic response. At least some of 
15 the immunogenic epitopes are shared (cross -reactive) 

between PilCl and PilC2, despite differences in primary 
amino acid sequence. The technique of subcloning 
discrete portions of the PilC protein under control of an 
inducible promoter allows mapping of antigenic epitopes. 
20 Sufficient quantities of specific oligopeptides of known 
antigenicity can be produced for use in screening the in 
vivo immune response after exposure to the intact 
pathogen. 

The following listed materials are on deposit 
25 under the terms of the Budapest Treaty with the American 
Type Culture Collection (ATCC) , 12301 Parklawn Dr., 
Rockville, Maryland 20852, and have been assigned the 
following Accession Numbers. 

30 Description ATCC No. Deposit Date 

pABJ03 in E~ coli (DH5) 68519 Jan. 28, 1991 

pABJ04 in E. coli (DH5) 68520 Jan. 28, 1991 
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Upon allowance and issuance of this application 
as a United States Patent, all restriction on 
availability of these deposits will be irrevocably 
removed; and access to the designated deposits will be 
5 available during pendency of the above-named application 
to one determined by the Commissioner to be entitled 
thereto under 37 CFR 1.14 and 35 USC 1.22. Moreover, the 
designated deposits will be maintained for a period of 
thirty (30) years from the date of deposit, or for five 

10 (5) years after the last request for the deposit; or for 
the enforceable life of the U.S. patent, whichever is 
longer. The deposited materials mentioned herein are 
intended for convenience only, and are not required to 
practice the present invention in view of the 

15 descriptions herein, and in addition these materials are 
incorporated herein by reference. 

Industrial Applicability 
The invention, in the various manifestations 

20 disclosed herein, has many industrial uses, some of which 
are the following. The pilC DNAs may be used for the 
design of probes for the detection of pilC nucleic acids 
in samples . The probes derived from the DNAs may be used 
to detect pilC nucleic acids in, for example, chemical 

25 synthetic reactions. The polynucleotide probes are also 
useful in detecting viral nucleic acids in humans, and 
thus, may serve as a basis for diagnosis of pathogenic 
microorganisms containing type 4 pilin, for example, 
gonococcal and/or meningococcal infections in humans. 

30 In addition to the above, the DNAs provided 

herein provide information and a means for synthesizing 
polypeptides containing epitopes of PilC. These 
polypeptides are useful in detecting antibodies to PilC 
antigens. A series of immunoassays the relevant 
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neisserial infection, based on recombinant polypeptides 
containing pilC epitopes are described herein, and will 
find commercial use in diagnosing diseases caused by 
these microorganisms. In addition, the polypeptides 
5 derived from the pilC DNAs disclosed herein will have 

utility as vaccines for treatment of infections caused by 
meningococci and gonococci. 

The polypeptides derived from the pilC DNAs, 
besides the above stated uses, are also useful for 

10 raising anti-PilC antibodies. Thus, they may be used in 
vaccines against the relevant microorganisms. Moreover, 
the antibodies produced as a result of immunization with 
the polypeptides containing an immunoreactive PilC 
epitope are also useful as passive vaccines, or in the 

15 detection of the presence of PilC antigens in samples. 
Thus, they may be used to assay the production of 
polypeptides derived from PilC in chemical systems. The 
anti-PilC antibodies may also be used to monitor the 
efficacy of anti-neisserial agents in screening programs 

20 where these agents are tested in tissue culture systems. 
Another important use for anti-PilC antibodies is in 
affinity chromatography for the purification of PilC 
derived polypeptides. The purified PilC polypeptide 
preparations may be used in vaccines. 

25 For convenience, the anti-PilC antibodies and 

polypeptides containing regions encoded in pilC, whether 
natural or recombinant, may be packaged into kits. 
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CLAIMS 

1. A recombinant polynucleotide encoding a 
polypeptide comprised of an immunoreactive epitope of a 
protein encoded in pilC of Neisseria.. 

2. The recombinant polynucleotide of claim 1, 
wherein the immunoreactive epitope is encoded in pilC of 
Neisseria gonorrheae. 

3. The recombinant polynucleotide of claim 2, 
wherein the immunoreactive epitope is encoded in pilC2 of 
Neisseria gonorrheae . 

15 4. The recombinant polynucleotide of claim 2, 

wherein the immunoreactive epitope is encoded in pilCl of 
Neisseria gonorrheae. 

5. A vector comprised of a recombinant 

20 polynucleotide, wherein the recombinant polynucleotide is 
selected from the group consisting of the recombinant 
polynucleotide of claim 1, the recombinant polynucleotide 
of claim 2 , the recombinant polynucleotide of claim 3 , 
and the recombinant polynucleotide of claim 4 . 

25 

6. A host cell transformed with the vector of 

claim 5 • 

7. A recombinant expression system comprising 
3 0 a polynucleotide encoding a polypeptide comprised of an 

immunoreactive epitope of a protein encoded in pilC of 
Neisseria, wherein the polynucleotide is operably linked 
to a control sequence compatible with a desired host. 

35 
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8, The recombinant expression system of claim 

7 , wherein the immunoreactive epitope is encoded in pilC 
of Neisseria, gonorrheae. 

5 9 . The recombinant expression system of claim 

8, wherein the immunoreactive epitope is encoded in pilCl 
of Neisseria gonorrheae* 

10* The recombinant expression system of claim 
10 8, wherein the immunoreactive epitope is encoded in pilC2 
of Neisseria gonorrheae. 

11. A cell transformed with a recombinant 
expression system, wherein the expression system is 
15 selected from the recombinant expression system of claim 
7, the recombinant expression system of claim 8, the 
recombinant expression system of claim 9, and the 
recombinant expression system of claim 10. 

20 12. A polypeptide produced by the cell of 

claim 11. 

13. A purified polypeptide comprised of an 
immunoreactive epitope of a protein encoded in pile of 

25 Neisseria. 

14. A purified polypeptide according to claim 
13, wherein the immunoreactive epitope is encoded in pilC 
of Neisseria gonorrheae. 



30 



15. A purified polypeptide according to claim 
14, wherein the immunoreactive epitope is encoded in 
pilC2 of Neisseria gonorrheae. 



35 
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16. A purified polypeptide according to claim 
14, wherein the immunoreactive epitope is encoded in 
pilCl of Neisseria, gonorrheae . 

5 17. A recombinant polypeptide comprised of an 

immunoreactive epitope of a protein encoded in pilC of 
Neisseria. 

18 . A recombinant polypeptide according to 
10 claim 17, wherein the immunoreactive epitope is encoded 

in pilC of Neisseria gonorrheae . 

19 . A recombinant polypeptide according to 
claim 18, wherein the immunoreactive epitope is encoded 

15 in pilC2 of Neisseria gonorrheae. 

20. A recombinant polypeptide according to 
claim 18, wherein the immunoreactive epitope is encoded 
in pilCl of Neisseria gonorrheae. 

20 

21. A method of preparing a recombinant 
polypeptide comprised of an immunoreactive epitope of a 
protein encoded in pilC of Neisseria, the method 
comprising: 

25 a. providing a host cell according to claim 

11; 

b. incubating the cell under conditions which 
allow expression of the recombinant polypeptide; and 

c. isolating the polypeptide. 



30 
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22 . A vaccine composition for the treatment of 
Neisseria infection, comprised of a pharmaceutically 
acceptable excipient and of an effective amount of a 
recombinant polypeptide, wherein the polypeptide is 
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comprised of an immunoreactive epitope of a protein 
encoded in pllC of Neisseria. 

23- The vaccine composition of claim 22, 
5 wherein the immunoreactive epitope is encoded in pllC of 
Neisseria gonorrheae. 

24. A polypeptide affixed to a solid 
substrate, wherein the polypeptide is selected from the 

10 group consisting of the polypeptide of claim 12, the 

polypeptide of claim 13, the polypeptide of claim 14, the 
polypeptide of claim 15, the polypeptide of claim 16, the 
polypeptide of claim 17, the polypeptide of claim 18, the 
polypeptide of claim 19, and the polypeptide of claim 20. 

15 

25. An immunoassay for detection of anti- 
Neisseria antibodies comprising: 

(a) providing a sample suspected of containing 
ant ±- Neisseria antibodies; 

20 (b) providing an antigen, wherein the antigen 

is a polypeptide selected from the group consisting of 
the polypeptide of claim 12, the polypeptide of claim 13, 
the polypeptide of claim 14, the polypeptide of claim 15, 
the polypeptide of claim 16, the polypeptide of claim 17, 

25 the polypeptide of claim 18, the polypeptide of claim 19, 
the polypeptide of claim 20; and 

(c) incubating the sample of (a) with the 
antigen of (b) under conditions which allow the formation 
of antibody- antigen complexes; and 

30 (d) detecting the presence of anti- Neisseria 

antibody- antigen complexes formed in (c) , if any. 

26. A composition comprised of a polypeptide, 
wherein the polypeptide is selected from the group 
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consisting of the polypeptide of claim 17, the 
polypeptide of claim 18 , the polypeptide of claim 19 , and 
the polypeptide of claim 20. 

5 27. A composition comprised of purified 

polyclonal anti-PilC antibodies, wherein the PilC is of 
Neisseria. 

28. A composition comprised of a monoclonal 
10 antibody directed against an immunoreactive epitope 

encoded in pilC of Neisseria. 

29. An immunoassay for detection of an antigen 
encoded in pilC of Neisseria comprising: 

15 (a) providing a sample suspected of containing 

an antigen encoded in pilC of Neisseria; 

(b) providing a composition comprised of 
antibodies directed against the antigen encoded in pilC 
of Neisseria, wherein the composition is selected from 

20 the group of compositions of claim 27 and claim 28; 

(c) reacting the sample of (a) with the 
antibody containing composition of (b) under conditions 
which allow the formation of anti-PilC antibody -antigen 
complexes ; 

25 (d) detecting anti-PilC antibody- antigen 

complexes formed in (c) , if any. 

30. A kit for analyzing samples for the 
presence of anti-PilC antibodies comprising: 

30 (a) an antigen packaged in a suitable 

container, wherein the antigen is a polypeptide selected 

from the group consisting of the polypeptide of claim 12, 

the polypeptide of claim 13, the polypeptide of claim 14, 

the polypeptide of claim 15, the polypeptide of claim 16, 

35 
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the polypeptide of claim 17 , the polypeptide of claim 18, 
the polypeptide of claim 19 , and the polypeptide of claim 
20; 

(b) a buffer used in the performance of the 
5 analysis, packaged in a suitable container; and 

(c) instructions on the performance of the 
analysis which uses the antigen of (a) and the buffer of 
(b) . 

10 31. A kit for analyzing samples for the 

presence of an antigen comprised of an immunoreactive 
epitope encoded in pile of Neisseria comprising: 

(a) a composition comprised of antibodies 
directed against the antigen comprised of an 

15 immunoreactive epitope encoded in pile of Neisseria, 
wherein the composition is selected from the group of 
compositions of claim 27 and claim 28, wherein the 
composition is packaged in a suitable container; 

(b) a buffer used in the performance of the 
20 analysis, packaged in a suitable container; and 

(c) instructions for performing the analysis. 

32. A method for producing antibodies to PilC 
of Neisseria comprising administering to an individual a 
25 composition comprised of an isolated immunogenic 

polypeptide containing a PilC epitope in an amount 
sufficient to produce an immune response to the PilC 
epitope. 

30 33. An oligomer capable of .hybridizing to a 

sequence in pile of Neisseria, wherein the oligomer is 
comprised of a pile sequence complementary to at least 
about 6 contiguous nucleotides of pile. 
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34. An oligomer according to claim 33, wherein 
pilC is of Neisseria gonorrheae . 

35. A process for detecting a pilC sequence in 
5 an analyte strand, wherein the pilC sequence comprises a 

selected target region, the process comprising: 

(a) providing a sample comprised of an analyte 
strand suspected of containing a selected target pilC 
sequence; 

10 (b) providing an oligomer capable of 

hybridizing to the target pilC sequence, wherein the 
oligomer is comprised of a pilC targeting sequence 
complementary to at least about 6 contiguous nucleotides 
of pilC; 

15 (c) incubating the sample of (a) with the 

oligomer of (b) under conditions which allow specific 
hybrid duplexes to form between the targeting sequence 
and the target sequence; and 

(d) detecting hybrids formed between the 

20 target sequence, if any, and the oligomer. 

36. The process of claim 35 which further 

comprises : 

(a) providing a set of oligomers which are 

25 primers for a polymerase chain reaction (PCR) method and 
which flank the target region; and 

(b) amplifying the target region via the PCR 

method. 

3 0 37. A kit for detecting a pilC sequence in an 

analyte strand comprising: 

(a) the oligomer of claim 33, packaged in a 
suitable container; 

35 
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(b) a suitable buffer, packaged in a suitable 



container ; and 



(c) instructions for performing the detection. 



5 



38. A recombinant polynucleotide comprising a 



DNA sequence of at least 8 contiguous nucleotides from 
pile, wherein the pile sequence is selected from the 
group of sequences shown in Figure 3, Figure 6, and 
Figure 7. 



10 



39. A method of treating an individual for a 



Neisseria infection comprising administering to the 
individual antibodies produced according to claim 32, 
wherein the antibodies are administered in an amount 
IS effective to prevent the pathology of the infection. . 

40. An immunoassay for detection of anti- 
Neisseria antibodies comprising: 

(a) providing a sample suspected of containing 
20 anti-tfeisseria antibodies; 

(b) providing an antigen, wherein the antigen 
is the polypeptide of claim 24; 

(c) incubating the sample of (a) with the 
antigen of (b) under conditions which allow the formation 

25 of antibody-antigen complexes; and 

(d) detecting the presence of anti-tfeisseria 
antibody-antigen complexes formed in (c) , if any. 



30 



41. A kit for analyzing samples for the 
presence of anti-PilC antibodies comprising: 

(a) an antigen packaged in a suitable 



container , wherein the antigen is a polypeptide from 
claim 24; 
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(b) a buffer used in the performance of the 
anaylsis, packaged in a suitable container; and 

(c) instructions on the performance of the 
analysis which uses the antigen of (a) and the buffer of 

5 (b). 
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GA7CCGCCCGG7GC77GGGGCCC77AGGGAACCG77CCC777GAGCCGGGGCGGGGCAAC 

70 90 110 

GCG7ACCGG777TTG77AA7CCGC7A7AAAAGGCGGGC7A7AGGG7AGGCTTCATCCTG 



CCAATCTCACTGAATCCGTCAATTTCCGCAATTCAATTAAATACCGTCAAACCGATGCCG 

190 210 230 

TCA77CCGCGCAGGCGGGAA7CCGGACC6G7CGGGCATC7GCGGCGG777GC7AAAAAAC 

250 270 290 

GC777ACCG7GA7AAG7GCGCA&AG77AAAA7GGGGAGG7AAGC7777CAATCAGCAA7C 

310 330 350 

CG0CGGGCGCGGAA7CGGGCGG777ACCGAACCCCGGCG77CGCGGCGCCCG7CCCGCGA 

370 390 410 

AGGCAAAC7XAAGGAA7AAAA7A7GAATAAAAC777GAAACGGCAGGT777CCGCCA7AC 

430 450 470 

CGCGC777ATGCCGCCATC77GA7G7777CCCA7ACCGGCGGGGGGGGGGGGCGA7GGCG 

490 510 530 

CAAACCCATCAA7ACGCTA77A7CA7GAACGAGCGAAACCAGCCCGAGG7AAAGCAGAA7 

550 570 590 

GTGCCA7CTTCAATJUUU3GACAAAGACAGGAGGCGCGAATA7ACT7A7TATACGCACAGA 

610 630 650 

ACAGGAG6AGGC7C7G7C7CA77CAACAA7AACGA7ACCC77G777CCCAACAAAGCGG7 

670 690 * 710 

AC7GCCG77777GGCACAGCCACC7ACC7GCCGCCC7ACGGCAAGG777CCGG7TT7GA7 

730 750 770 

GCCG7CGC7CTGAAAGAGCGCAACAA7GCCG77GA77GGA77CG7ACCACCCGCATCGCG 

790 610 830 

C76GCAGGC7AC7CC7ACA7CGACG7CA7A7GCAGAAGC7ACACAGGC7GTCCCAAAC77 

850 870 890 

G7C7A7AAAACCCGA777ACC77CGG7CAACAAGGG77GAAAAGAAAGGCAGGCAGCAAG 

910 930 950 

C7GGATATATACGAAGACAAAAGCCaCOAAAA77CaCCCA7T7ACAAA77Q7CGOAT7AT 

970 990 1010 

CC77GG77GGGCG7ATC777CAA777GGGCAGCGAGAATACCG7CCAAAAXAGCAAA77A 

1030 1050 1070 

7TCAACAAAT7GATATC77C7777AGAGAAGGCAA7AA7AA7CAAACCA7CGTCTC7ACG 

1090 1110 1130 

ACAGAAGGCAACCC7A777CCC77GGCGACCGGCAGCGCGAACATACCGCCG7GGCCTAT 

1150 1170 1190 

7A7C7GAACGCCAAAC7GCACC7GC7GGACAAAAAAGGGA77GAAGA7A7CGCCCAAGGC 

1210 1230 1250 

AAAA7AG7GGA777GGG7A7C77GAAACCGCACG7CGAGACGACAGGACGAAGCT7GC7A 

1270 1290 1310 

GA77777GGGC7AGG7GGGACA77AAAGA7ACCGGGCAGA77CCGG7CAAGC7CGGCCTG 

1330 1350 1370 

CCGCAAGTCAAAGCAGGCCGC7GCACCAACAAACCGAACCCCAA7AA7AA7ACCAAAGCC 

1390 1410 1430 

CC77CGCC0GCAC7GACCGCCCCCGCGCTGTGGT7CGGACCCGGGCAAGATGG7AAGGCG 

1450 1470 1490 

GAGATG7A77CCGC77CGG777CCACC7ACCCCGACAG77CGAGCAGCCGCA7C77CC7C 

1510 1530 1550 

CAAGAGC7GAAAAC7CAAACCGAACCCGGCAAACCCGGCCGC7ATTCCC7CAAATC777G 

1570 1590 1610 

AATGATGG7GAGA77AAAAG7CGACAGCCGAGTT7CAACGGGCGGCAAACAATCA7CCGA 

1630 1650 1670 

77GGATGACGGCG7ACA777GA7CAAAC7GAA7GGAAGCAAGGA7GAGGXCGCCGC7777 

1690 1710 3730 

G7CAA777AAA7GGAAACAACACCGGCAAAAACGACAC777CGGCAT7G7TAAGGAAGCG 

1750 1770 1790 

AACG7CAA7C77GACGCCGACGAG7GGAAAAAAG7GC7GC7GCCT7GGACGGTTCGGGG7 

1010 1830 18S0 

CCCGA7AATGACAA7AAA7X7AAA7CAATXAACCAAAAACCAGAAAAA7ACAGCCAAAGA 
1870 1890 1910 
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TACCGCAxCCGCGACAAeAACCGCAAXCGCGA' 



1930 



TXXGGGCGACAXCGTCAACAGCCCGATT 
1950 1970 

OTCOCaOTCGOCGGGTATTTGSCAACCSCCGCaAACOACSOGATSBTaCATATCTTCAAA 
199Q 2010 2030 

2 0 7 0 9 ft Q ft 

*'^^2??^^^^*^*''^*'^®^^^OAAXCCACCCXXGCCAAAGAGCXGCGCGCCXTT 
10 2130 2150 

GCCGAAAAAGGCIATGTGGGCGACCGCTACGGCGTaGACQGCGGCTTTGTCTTGCGCCAA 

2190 5$ift 

GTCGAACTOAGCGGGCAAAAACACGTGTITATGTTCGGCGCGATGGOTTTTOGCGGCAGG 

GGCGCGTATGCCTTGGATTTAAGCAAAATCAACGGAAATTA'rCCGGCCGCCGCCCCCCTG 

2310 233ft 

TTTGATGICAAAGATGGCGATAATAACGGCAAAAATC6C6TGAAAGTGGAAXTAGGCTAC 
ACCGTCGGTACGCCGCAAATCGGCAAAATCCGCAACGGCAJUITACSCCGC^ 
TCCGGTTATGCGGCTAAAAAAATTGACGACTCAACAAAIAAAACCOCGCM 
GATTTGAAAGACACCITAGGTACGCCGATTGCAAAAATCGAAGTGAAGGACGGCAAAGG 

OCCGGCG^CCGGGGCGGCAAtATGTACCGCTTTGATTTGAGCAATICCGATTCTAGTAAA 

TGGTCTGCAAAGGTTATTTTCGAAGGCGACAAGCCGATTACCTCCGCGCCCGCCGTTTCC 

CGACTGGaWACAAACGCGTCGXCATCWCGGTACGGGCAGCGAITTSACCGAAGATGAT 

GTACTOAATACGGGCGAACAATATATTOACGGTATCTTTGACGACGATAAGGGGACGGTT 

AAGGTAACGGTACAAAACGGCACGGCAGGCGGGCTGCTCGAGCAACACCT 

AATAAAACATTATTCCTGAACAAGAGATCCGACG^ 

AAXTTGAGOOAAOOAGAACGCGTTACCGXC^UVACCGACCGIGGTATXGCG^ 
0XAACCAXCCGCAAAXAXAACGAC6GCGGCXGCGGCGCGGAAACCGCCATTT 7GGGCATC 

aaxaccgccgacock:gg C gcaxxgacxccgag^ 
aattcggtxgcgcaatattccc^ccat^Sacaaccxcc^^ 
ggttgxatg^acaaagacggxaaaacwxcxgcccgaacggataxgtttacga 
gtiaaxgtgcgxxaxcxggaxgaaacggaaac^gacggaxxxtcaac^ CGGACGGC 
gatgcgggcggcagcggxaxa^ccccgccggcaggcgxccc^ 

TXCXCCAAAAJ^OGGGXGCGCACCCX^XGAXGAACGAXXTC^^^ 

ggcccgaxgxgcggtaxcaaacgcxtaa"xggcgcgaagxcxxcxx^^^ 
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GCGGCCGGXXTrxcCGCAAAXGCCGXCCGAAAGGCCXXCGGACGGCATTTTXTTGCGTTT 
TTCGGGAGGGGGGCGGCAAATGAAACG 
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10 30 50 

GATCCQCCCGOTGCTrSSaGCCCTTAGGGAACCGTTCCCTTTGAGCCGGGGCGGGGCAAC 
70 90 HO 

GCOTACCGGTTTTTOTTAATCCGCTAtAAAAGOCGGGCTATAGGGTAGGCTTCATCCTG 
130 150 170 

CCAATCTCACTOAATCCGTCAATTTCCGCAATTCAATIAAATACCGTCAAACCGATGCCG 
190 210 230 

TCATTCCaCGCAGGCGGGAATCCGGACCGGTCGGGCATCIGCGGCGOTTTGCTAAAAAAC 
250 270 290 

GCTTTACCGTGATAAGtGCGCAAAGTTAAAATGGGGAGGTAAGCTTTTCAATCAGCAATC 
310 330 350 

CGGCGGGCGCGGAATCGGGCGGTTTACCGAACCCCGGCGTTCGCGGCGCCaSTCCCGl 
370 390 Jin 



J7 ° 390 410 

AGGCAAACTTAAGGAATAAAATATGAATAAAACTTTGAAACGGCAGGX'fWCCGCCATAC 

MNKTLKROVPRHT 
430 450 470 

CGCGCTTTATGCC3CCATCTTGATGTTTTCCCATACCGGCGGGGGGGGGGGGGCGATGUC; 

Jon All<Mr *HTGGGGGAMA 
4,0 510 530 

G ^ CC ? T ^3 ACQCTATTATCAT< 5AACGAGCGAAACCAGCCCGAGGTAAAGCAGAA 
550 YAIXM .N_ BRNQPEVKQN 

s i 
610 
tGGAC 

T « r- i- t, -~.-».™.v»Ai<iu^^iiuti'fi;ttAftWUU«it5v 

670 SVSrN ^ NOTLVSQQSG 



sav 570 590 

TGTGCCATCTTCAATAAAGGACAAAGACAGGAGGCGCGAATATACTTATTATACGCACAG 
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SGYAAKKIDDS7NK7AL*VY 

2470 2490 2510 

7GA777GAAAGACACC77AGG7ACGCCGA77GCAAAAA7CGAAG7GAAGGACGGCAAAGG 
DLKD7LGTP 1AKIEVKDGKG 



FIGURE 4-2 



BNSOOOD: «WO_021 3871A1JU- 



WO 92/13871 PCT/US92/00863 

2SS0 2550 2170 

C03BCTTTCGTCCCCCACGCT08X60ATAAASAITTSaACS5CAC03XCGATATCGCCTA 
sLfi ' V D k D IjQGTVD I A ¥ 

2610 2£3ft 

TeCC30C0ACC3030C00CAATAT0TACCaCTTrSATTT6ASCAA»TCC6ATTCTAGTAA 

2670 2£Qrt 

A J°°*«GCAAAGGra^ 

271fl Vrr£SDKp IT6AE-AVS 

2730 27S0 

2770 KRVV IPGTQ6DI ( TEDD 

2790 9fiin 
T «ACTGAATACGGGCGAAC^^^^ 

2830 GCQYI ^0IFDDDK3TV 

2850 5fl*TA 
2 Lo V A 29 ? 0 • * * ■ Q H J T o E 

**^*k**t^^l'^'^?^ G ^^^^ g ^^^*^^^ a ^^^'*^®®®^ a ®^^^®®'^'^^'''»ggt 
2950 R 29 5 0 D e*S$KG W AV 

oaaattgaogsaagsagaacgcgttaccgtcaaaccgaccgtggtawSSg^^ 

iSlO T 30 ; 0 K * * V V L R T A F 

TTTT C I^J ATAACGACM ^^^ 

mo" ° 30 j 0 c s A E T i * > 6 1 

3 A 30 D T 3l | 0 ^ S A R P I V P D K 

CAAWC^TTOCG^ 

3»0 H 32 J 0 »*•«•;• t r i 

l*OTTGTAT03ACAAA3A^ 

3250 T 32 y 0 C ?NG5fVTD K P 

OOWAATOTOCGTTATCTGGATGAAAC^CAGACGGATITT^ 

3310 T = *£>GFSTTADG 



3330 33S0 

DAGOS ft -"""™ JCCGCCGGCAG6CGTCM ^ 

3370 D GRRPGKNNRC 



C0AT3C333C33CAGCB3TATA3ACCCCGI 



3430 ^ M N D «• * 3 X, D I T 

WGCCCGXTG^^ 



3490 



r r * 



CGC6GCCGGTTTTTCCGCAAATGCCGTCCGAAA3GCCTTCG3ACS3CATTTT? STGCGTT 
WTCOGGAGGGGGGCGGCAAATGAAACG 



BNSDOCID: <MR>_jeiS871A1JU> 



FIGURE 4-3 



WO 92/13871 M Y//5^ PCT/US92/00863 



10 30 50 

GAXCCGCCCGGXGCTTGGGGCCCXXAGGGAACCGXTCCCXXXGAGCCGGGGCGGGGCAAC 

70 90 110 

GCGXACCGGXXXTXGXXAAXCCGCXAXAAAAGGCGGGCTATAGGGXAGGCXTCATCCXG 

130 ISO 170 

CCAAXCXCACXGAAXCCGXCAAXXXCCGCAAXXCAAXXAAAXACCGXCAAACCGAXGCCG 

190 210 230 

TCAXXCCGCGCAGGCGGGAAXCCGGACCGGXCGGGCAXCXGCGGCGGXXXGCXAAAAAAC 

250 270 290 

GCXXXACCGXGAXAAGXGCGCAAAGXXAAAAXGGGGAGGTAAGCXTXTCAAXCAGCAAXC 

310 330 350 

CGGCGGGCGCGGAAXCGGGCGGXXXACCGAACCCCGGCGXXCGCGGCGCCCGXCCCGCGA 

370 390 410 

AGGCAAACXXAAGGAAXAAAAXATGAAXAAAACXXTGAAACGGCAGGTTTTCCGCCATAC 
£c*M 1 MHKTLKRQVTRHT 

430 450 470 

CGCGCXXXAXGCCGCCAXCXXGAXGXXTXCCCAXACCGGCGGGGGGGGGGGGCGAXGGCG 
ALVAAIZiMrSHTGGGGGRWR 

f *U» 2 M A 

490 510 530 

CAAACCCATCAATACGCTATTATCATGAACGAGCGAAACCAGCCCOAGGXAAAGCAGAAT 

KPINTLL6* 
QTHQYAI IMNBRNQPEVKQK 

550 570 590 

GTGCCATCTTCAATAAAGGACAAAGACAGGAGGCGCGAATATACTTATTATACGCACAGA 
V P 8 6 IKD KDRRREVTVYTHR 

"■0 630 650 
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GAXXXXXGGGCXAGGTGGGACAXXAAAGAXACCGGGCAGAXTCCGGXCAAGCXCGGCCXG 
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1 CCGC7G7A7G7G7A7GA777GGAAAACACCAG7GG7AGTC7GAT7AAAAAAATCGAAGCA 
CCCGGCGGCAAAGGCGGGCTT7CG7CCCCCACGCTGG7GGA7AAAGATTTGGACGGCACG 
G7CGA7A7CGCCTA7GCCGGCGACCGGGGCGGCAA7ATGTACCGC7T7GA77TGAGCAAT 
7CCGA77C7AG7AAA7GGTC7GCAAAGG77A7TT7CGAAGGCGACAAGCCGAT7ACC7CC 
GCGCCCGCCG777CCCGACTGGCAGACAAACGCG7GG77A7C77CGGCACGGGCAGCGAT 
7T0AG7GAACAGGA7G7ACTGGA7ACGGACAAACAA7A7A777ACGGXATC7T7GACGAC 
GA7AAG7CGACGG77AA7GTAAAGGTAACAAACGGCACGGGAGGCGGGC7GC7CGAGCAA 
GTGC77AAAGAGGAAAG7AAAACC7XA77CCXGAGCAA7AA7AAGGCATCCGGCGGA7CG 
GCCGA7AAAGGG7GGG7AG7GAAA77GAGGGAAGGAGAAGGCG77ACCG7CAAACCGACC 
G7GG7A77GCG7ACCGCC777G7CACCA7CCGCAAA7A7ACGGA7ACGGACAAA7G7GGC 
GCGCAAACCGCCA7777GGGCA7CAA7ACCGCCGACGGCGGCGCATTGACTCCGAGAAUC 
GCGCGCCCGA77G7GCCGGATCACAA77CGG77GCGCAA7AT7CCGGCCA7CAGAAAA7G 
AACGGCAAGXCC A7CCCGG 739 
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Technical F^fl 

The invention relates to materials and 
methodologies for managing the spread of infections 
caused by microorganisms having type 4 pilin, for 
example, Neisseria. More specifically, it relates to 
polypeptides and antibodies useful in vaccines for the 
treatment of pathologic infections caused by these 
microorganisms. It also relates to polynucleotides 
useful for the recombinant production of these 
polypeptides. In addition, it relates to polypeptides, 
antibodies, and polynucleotides used for the detection of 
these strains. 

Background ^ r t 

Type 4 pilins are expressed by several 
bacterial genuses, including Neisseria, Moraxella, 
Bacteroides, and Pseudomonas . Species within these 
genuses which have pathogenic members that express type 4 
pilins are, for example, N. gonorrhoeae, N. meningitidis, 
M. bovis, B. nodosus, and P. aeruginosa. In addition, 
the Tcp pilin of V. cholerae is highly homologous to the 
type 4 pilins of other genuses. 
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The only known reservoir of the neisseriae is 
man. The genus includes two gram- negative species of 
pyogenic cocci that are pathogenic for man: the 
meningococcus (Neisseria meningitidis) and the gonococcus 
5 (Neisseria gonorrheae) . 

N. Meningitidis causes a variety of infections, 
most notably, meningitis and bacteremia. Meningococci 
can be divided into serologic groups on the basis of 
agglutination reactions with immune serum. The present 
10 classification includes groups A through Z. Clinically 
significant new groups encompass Y and W 135. The major 
groups are remarkably heterogeneous, but subclassi- 
fication with additional serologic markers has been 
possible. Noncapsular antigens have provided the basis 
15 for dividing strains of groups into distinct types. 

Meningococci cause either epidemic or sporadic 
disease, and historically, there has been a cyclic 
variation in the prevalence of meningococcal infection 
with peaks of increased frequency occurring every 8 to 12 
years and lasting 4 to 6 years. The attack rate of 
meningococcal disease is highest for children between 6 
months and l year. In the first half of this century, 
most epidemics of meningococcal disease in the United 
States were caused by group A organisms. In the past two 
decades, first group B then group C meningococci were 
responsible for outbreaks in both the military and 
civilian populations. Currently, group B is responsible 
for 50 to 55 percent of reported cases. 

Gonorrhea, which is caused by N. gonorrhea, is 
an infection of columnar and transitional epithelium. 
This disease is the most common reportable communicable 
disease in the United States, and also has world-wide 
prevalence. 
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Although treatment of disease caused by 
gonococci and meningococci are often treated with 
antibiotics, these microorganisms often develop 
antibiotic resistance. Thus, prevention with vaccines is 
a preferable mode to contain the spread of infection. 
However, for a variety of reasons, including antigenic 
variation, the development of vaccines has been greatly 
hampered. For example, a vaccine which prevents 
gonorrhea is still lacking. In addition, although 56% of 
the causes of meningococcal disease are caused by 
serogroup B, an effective vaccine against this serogroup 
is also lacking. 

If. gonorrheae and N. meningitidis are organisms 
completely adapted to the human host, having no other 
15 ecological niche. They have acquired a large arsenal of 
strategies to overcome the human host defense system. 

The first step in infection with pathological 
forms of these Neisseria is adherence to target cells. 
It is thought that the pili of these microorganisms are a 
20 major virulence factor. For example, it is known that in 
the case of N. gonorrheae, piliated (P + ) variants attach 
much better to susceptible cells than non-piliated (P-) 
variants (Swanson, 1973; Pear ce and Buchanan, 1978). 
Moreover, P+ variants, unlike P- variants, are able to 
25 establish an infection in human volunteers (Kellog et al, 
1968) . 

Although the pilus protein elicits an immune 
response, so many antigenic variants exist and continue 
to develop that vaccines against the pilus protein are 
30 not highly effective. 

Pilin is the major subunit of the pilus. 
Expression of pilin is controlled at the pilE locus. 
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9rief Description of i-fre Invsnting 

We have isolated and characterized a novel 
protein of pathogenic forms of Neisseria, Pile, that may 
be is associated with the pili of gonococci and 
5 meningococci. We have also isolated and characterized 
genes which encode Pile, i.e., the pile loci. 

Portions of the DNA sequences of the pile genes 
are useful as probes to diagnose the presence of the 
relevant Neisseria in samples. These DNAs also make 
10 available polypeptide sequences of immunoreactive 

epitopes encoded within the loci, thus permitting the 
production of polypeptides which are useful as standards 
or reagents in diagnostic tests and/or as components of 
vaccines for microorganisms with type 4 pilin and 
containing one or more epitopes that are immunologically 
identifiable with an epitope encoded in pile of 
Neisseria. Antibodies, both monoclonal and purified 
polyclonal, directed against Pile epitopes are also 
useful for diagnostic tests and as therapeutic agents for 
20 passive immunization, in addition, by utilizing probes 

derived from the DNA sequences, it is possible to isolate 
and sequence portions of the pile loci from species and 
strains of interest. 

Accordingly, one embodiment of the invention is 
a recombinant polynucleotide encoding a polypeptide 
comprised of an immunoreactive epitope of a protein 
encoded in pile of Neisseria. 

Another embodiment of the invention is a 
recombinant expression system comprising a polynucleotide 
encoding a polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pile of Neisseria, 
wherein the polynucleotide is operably linked to a 
control sequence compatible with a desired host. 
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Yet another embodiment of the invention is 
purified polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pile of Neisseria. 

Another embodiment of the invention is a 
recombinant polypeptide comprised of an immunoreactive 
epitope of a protein encoded in pile of Neisseria. 

Still another embodiment of the invention is a 
vaccine composition for the treatment of Neisseria 
infection, comprised of a pharmaceutical ly acceptable 
excipient and of an effective amount of a recombinant 
polypeptide, wherein the polypeptide is comprised of an 
immunoreactive epitope of a protein encoded in pile of 
Neisseria. 

Yet another embodiment of the invention is a 
composition comprised of purified polyclonal anti-PilC 
antibodies, wherein the PilC is of Neisseria. 

An additional embodiment of the invention is a 
composition comprised of a monoclonal antibody directed 
against an immunoreactive epitope encoded in pile of 
Neisseria. 

Another embodiment of the invention is a method 
for producing antibodies to PilC of Neisseria comprising 
administering to an individual a composition comprised of 
an isolated immunogenic polypeptide containing a PilC 
epitope in an amount sufficient to produce an immune 
response . 

Yet another embodiment of the invention is an 
oligomer capable of hybridizing to a sequence in pile of 
Neisseria, wherein the oligomer is comprised of a pile 
sequence complementary to at least about 6 contiguous 
nucleotides of pile. 

Still another embodiment of the invention is a 
process for detecting a pile sequence in an analyte 
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strand. wherein the pile sequence comprises a selected 
target region, the process comprising: 

(a) providing a sample comprised of an analyte 
strand suspected of containing a selected target pile 

5 sequence; 

(b) providing an oligomer capable of 
hybridizing to the target pile sequence, wherein the 
oligomer is comprised of a pile targeting sequence 
complementary to at least about 6 contiguous nucleotides 

10 of pilC; 

(c) incubating the sample of (a) with the 
oligomer of (b) under conditions which allow specific 
hybrid duplexes to form between the targeting sequence 
and the target sequence; and 

15 (d > detecting hybrids formed between the 

target sequence, if any, and the oligomer. 

Yet another embodiment of the invention is a 
recombinant polynucleotide comprising a DNA sequence of 
at least 8 contiguous nucleotides from pile, wherein the 
pile sequence is selected from the group of sequences 
shown in Figure 3, Figure 6, and Figure 7. 

Another embodiment of the invention is a method 
of treating an individual for a Neisseria infection 
comprising administering to the individual antibodies 
produced according to claim 31, wherein the antibodies 
are administered in an amount effective to prevent the 
pathology of the infection. 
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Prjef Rescript. ion of th» nr a «^ m f ! 

Figure l is a genetic and physical map of pile 
locus l, showing the restriction enzyme sites. 

Figure 2 shows the nucleotide sequence and the 
deduced amino acid sequence of the 5' -end of pilCl. 



35 



BN8DOCU): xWO_82iae71A1_lA> 



W ° 92/13871 PCI7US92/00863 



-7- 

Figure 3 shows the nucleotide sequence of the 
sense strand of the pilCl gene. 

Figure 4 shows the shows the nucleotide 
sequence of the sense strand of the pilCl gene and the 
5 amino acids encoded therein. 

Figure 5 shows the nucleotide sequence of the 
sense strand of the pilCl gene, and the effect of frame 
shift on the putative gene products encoded therein. 

Figure 6 shows the nucleotide sequences of PCR 
10 amplified fragments demonstrating a variation in the 

length of the G tract and sequence differences in the 5' 
region of the pile genes. 

Figure 7 shows the DNA sequence of the 3' -end 
of the pilC2 fragment. 
15 Figure 8 shows the pilC2 fragment sequence, and 

the putative amino acids encoded therein. 

Figure 9 shows a comparison of the analogous 
portions of pilC2 (top) and pilCl (bottom) DNA sequences, 
and the putative amino acids encoded therein. 

20 

Modes fpr Carrying Our the invent- < ffn 

The present invention provides polypeptides, 
antibodies, and polynucleotides which are useful for the 
detection and treatment of pathogenic microorganisms 

25 having type 4 pilin, for example. Neisseria, Moraxella, 
Bacteroides , and Pseudojnonas . 

We have discovered a polypeptide, Pile, which 
is present in N. gonorrheae. This polypeptide is a no 
led protein that is closely associated with the pili of 

30 the microorganism. Most strains of N. gonorrheae carry 
two copies of the corresponding genes which encode the 
polypeptide (s) ; these genes have been denoted pile. 
Expression from the pile loci is regulated by frequent 
frameshift mutations within a run of G residues in the 
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region encoding the signal peptide. The two pile genes 
of N. gonorrheae are not identical. Hence, alternate 
expression from either the pilCl or the pilC2 loci gives 
rise to two different forms of Pile. Among nonpiliated 
(P ) descendants from P + clones, clones were found that 
expressed pilin but not Pile. All p + revertants from 
such Pile" non-piliated clones have regained expression 
of Pile. Hence, phase variation of gonococcal pili can 
be caused by frameshift mutations in pile. Transposon 
inactivation of the expressed pilC2 copy resulted in a 
nonpiliated, pilin producing revertible phenotype. It 
appears, therefore, that Pile is required for assembly of 
pilin subunits into a polymerized pilus fiber in N. 
gonorrheae . 

We have cloned and isolated gene, pilCl, from 
N. gonorrheae. In addition, by comparison of this gene 
sequence with a related sequence, we have cloned a 
fragment of the pil2 gene. Moreover, using 
polynucleotide probes derived from isolated pilCl and PCR 
amplification, we have detected two possible variants of 
a pile gene in N. meningitidis. The sequences of pile 
reported herein appear to be novel, in that there are no 
reported counterparts in Genbank, and no significant 
homologies were found with any available sequences in 
25 that data base. 

The useful materials and processes of the 
present invention are made possible by the provision of 
the sequences of the pile genes from nr. ^onorrneae and 
from N. meningitidis. Information present in the 

30 sequences of the pile genes allows for the design of 
polypeptides which may be useful as vaccines for 
treatment of pathogenic Neisseria, as diagnostic tools 
for the detection of these microorganisms, and as agents 

35 for the preparation of antibodies to Pile. In addition, 
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this information allows for the design of polynucleotides 
for the recombinant production of the polypeptides 
derived from Pile, and for the design of oligomers which 
are useful as probes and primers for the detection and 
5 amplification of target regions of pile. 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the 
10 skill of the art. Such techniques are explained fully in 
the literature. See e.g., Maniatis, Fitsch & Sambrook, 
MOLECULAR CLONING; A LABORATORY MANUAL, Second Edition 
(1989); DNA CLONING, VOLUMES I AND II (D.N Glover ed. 
1985); OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984); 
15 NUCLEIC ACID HYBRIDIZATION (B.D. Haines & S.J. Higgins 

eds. 1984); TRANSCRIPTION AND TRANSLATION (B.D. Hames & 
S.J. Higgins eds. 1984); ANIMAL CELL CULTURE (R.I. 
Freshney ed. 1986); IMMOBILIZED CELLS AND ENZYMES (IRL 
Press, 1986); B. Perbal, A PRACTICAL GUIDE TO MOLECULAR 
20 CLONING (1984) ; the series, METHODS IN ENZYMOLOGY 
(Academic Press, Inc.); GENE TRANSFER VECTORS FOR 
MAMMALIAN CELLS (J.H. Miller and M.P. Calos eds. 1987, 
Cold Spring Harbor Laboratory) , Methods in Enzymology 
Vol. 154 and Vol. 155 (Wu and Grossman, and Wu, eds., 
25 respectively), Mayer and Walker, eds. (1987), 

IMMUNOCHEMICAL METHODS IN CELL AND MOLECULAR BIOLOGY 
(Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
( Springer -Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL 
30 IMMUNOLOGY, VOLUMES I -IV (D.M. Weir and C. C. Blackwell 
eds 1986) . All patents, patent applications, and 
publications mentioned herein, both supra and infra, are 
hereby incorporated herein by reference. 
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As used herein, a polynucleotide "derived from" 
a designated sequence refers to a polynucleotide sequence 
which is comprised of a sequence of approximately at 
least about 6 nucleotides, preferably at least about 8 
5 nucleotides, more preferably at least about 10-12 

nucleotides, and even more preferably at least about 
15-20 nucleotides corresponding to a region of the 
designated nucleotide sequence. "Corresponding- means 
homologous to or complementary to the designated 
10 sequence. Regions from which typical polynucleotide 

sequences may be "derived" include but are not limited 
to, for example, regions encoding specific epitopes, as 
well as non- transcribed and/or non- translated regions. 

The derived polynucleotide is not necessarily 
physically derived from the nucleotide sequence shown, 
but may be generated in any manner, including for 
example, chemical synthesis or DNA replication or reverse 
transcription or transcription. in addition, 
combinations of regions corresponding to that of the 
designated sequence may be modified in ways known in the 
art to be consistent with an intended use. 

Similarly, a polypeptide or amino acid sequence 
"derived from" a designated nucleic acid sequence refers 
to a polypeptide having an amino acid sequence identical 
to that of a polypeptide encoded in the sequence, or a 
portion thereof wherein the portion consists of at least 
3-5 amino acids, and more preferably at least 8-10 amino 
acids, and even more preferably at least 11-15 amino 
acids, or which is immunologically identifiable with a 
polypeptide encoded in the sequence. This terminology 
also includes a polypeptide expressed from a designated 
nucleic acid sequence. 

A recombinant or derived polypeptide is not 
necessarily translated from a designated nucleic acid 
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sequence. It may be generated in any manner, including 
for example, chemical synthesis, or expression of a 
recombinant expression system, or isolation from a 
microorganism. A recombinant or derived polypeptide may 
include one or more analogs of amino acids or unnatural 
amino acids in its sequence. Methods of inserting 
analogs of amino acids into a sequence are known in the 
art. It also may include one or more labels, which are 
known to those of skill in the art. 

The term "recombinant polynucleotide" as used 
herein intends a polynucleotide of genomic, cDNA, 
semisynthetic, or synthetic origin which, by virtue of 
its origin or manipulation: (1) is not associated with 
all or a portion of a polynucleotide with which it is 
15 associated in nature, (2) is linked to a polynucleotide 

other than that to which it is linked in nature, (3) does 
not occur in nature, or (4) is not in the form of a 
library. 

The term "polynucleotide" as used herein refers 
to a polymeric form of nucleotides of any length, either 
ribonucleotides or deoxyribonucleotides . This term 
refers only to the primary structure of the molecule. 
Thus, this term includes double- and single -stranded DNA 
and RNA. It also includes known types of modifications, 
25 for example, labels which are known in the art, 

methylation, "caps", substitution of one or more of the 
naturally occurring nucleotides with an analog, 
internucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates, 
phosphotriesters, phosphoamidates , carbamates, etc.) and 
with charged linkages (e.g., phosphorothioates, 
phosphorodithioates, etc.), those containing pendant 
moieties, such as, for example proteins (including for 
e.g., nucleases, toxins, antibodies, signal peptides. 
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poly-L-lysine, etc.), those with intercalators (e.g., 
acridine, psoralen, etc.), those containing chelators 
(e.g., metals, radioactive metals, boron, oxidative 
metals, etc.), those containing alkylators, those with 
modified linkages (e.g., alpha anomeric nucleic acids, 
etc.), as well as unmodified forms of the polynucleotide. 

The term "purified polynucleotide" refers to a 
polynucleotide which is essentially free, i.e., contains 
less than about 50%, preferably less than about 70%, and 
even more preferably less than about 90% of polypeptides 
with which the polynucleotide is naturally associated. 
Techniques for purifying polynucleotides from bacteria 
are known in the art, and include for example, disruption 
of the bacteria with a chaotropic agent, differential 
15 extraction and separation of the polynucleotide (s) and 
polypeptides by ion-exchange chromatography, affinity 
chromatography, and sedimentation according to density. 

The term "purified polypeptide" refers to a 
polypeptide or fragment thereof which is essentially 
20 free, i.e., contains less than about 50%, preferably less 
than about 70%, and even more preferably less than about 
90%, of cellular components with which the polypeptide is 
naturally associated. Techniques for purifying 
polypeptides are known in the art, and examples of these 
25 techniques are discussed infra. 

"Recombinant host cells", "host cells", 
"cells", "cell lines", "cell cultures", and other such 
terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells 
which can be, or have been, used as recipients for 
recombinant vector or other transfer DNA, and include the 
progeny of the original cell which has been transfected. 
It is understood that the progeny of a single parental 
cell may not necessarily be completely identical in 
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morphology or in genomic or total DNA complement as the 
original parent, due to natural, accidental, or 
deliberate mutation. 

A "replicon" is any genetic element, e.g., a 
plasmid, a chromosome, a virus, a cosmid, etc. that 
behaves as an autonomous unit of polynucleotide replica- 
tion within a cell; i.e., capable of replication under 
its own control . 

A "vector" is a replicon in which another 
polynucleotide segment is attached, so as to bring about 
the replication and/or expression of the attached 
segment . 

"Control sequence" refers to polynucleotide 
sequences which are necessary to effect the expression of 
coding sequences to which they are ligated. The nature 
of such control sequences differs depending upon the host 
organism; in prokaryotes, such control sequences 
generally include promoter, ribosomal binding site, and 
terminators; in eukaryotes, generally, such control 
sequences include promoters, terminators and, in some 
instances, enhancers • The term "control sequences" is 
intended to include, at a minimum, all components whose 
presence is necessary for expression, and may also 
include additional components whose presence is 
advantageous, for example, leader sequences. 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a relationship 
permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence 
is ligated in such a way that expression of the coding 
sequence is achieved under conditions compatible with the 
control sequences . 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 
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region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5' -terminus and a translation stop codon at 
the 3' -terminus. A coding sequence can include, but is 
not limited to mRNA, cDNA, and recombinant polynucleotide 
sequences . 

"Immunologically identifiable with/as" refers 
to the presence of epitope (s) and polypeptides (s) which 
are also present in the designated polypeptide (s) . 
Immunological identity may be determined by antibody 
binding and/or competition in binding; these techniques 
are known to those of average skill in the art, and are 
also illustrated infra. 

As used herein, "epitope" refers to an 
antigenic determinant of a polypeptide. An epitope could 
comprise 3 amino acids in a spatial conformation which is 
unique to the epitope. Generally an epitope consists of 
at least 5 such amino acids, and more usually, consists 
of at least 8-10 such amino acids. Methods of 
determining the spatial conformation of amino acids are 
known in the art, and include, for example, x-ray 
crystallography and 2 -dimensional nuclear magnetic 
resonance . 

A polypeptide is "immunoreactive" when it is 
30 "immunologically reactive" with an antibody, i.e., when 
it binds to an antibody due to antibody recognition of a 
specific epitope contained within the polypeptide. 
Immunological reactivity may be determined by antibody 
^ binding, more particularly by the kinetics of antibody 
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binding, and/or by competition in binding using as 
competitor (s) a known polypeptide (s) containing an 
epitope against which the antibody is directed. The 
techniques for determining whether a polypeptide is 
immunologically reactive with an antibody are known in 
the art. An "immunoreactive" polypeptide may also be 
-immunogenic''. As used herein, the term "immunogenic 
polypeptide" is a polypeptide that elicits a cellular 
and/or humoral immune response, whether alone or linked 
to a carrier in the presence or absence of an adjuvant. 

As used herein, the term "antibody" refers to a 
polypeptide or group of polypeptides which are comprised 
of at least one antibody combining site. An "antibody 
combining site" or "binding domain" is formed from the 
15 folding of variable domains of an antibody molecule (s) to 
form three-dimensional binding spaces with an internal 
surface shape and charge distribution complementary to 
the features of an epitope of an antigen, which allows an 
immunological reaction with the antigen. An antibody 
20 combining site may be formed from a heavy and/or a light 
chain domain (VH and VL, respectively) , which form 
hypervariable loops which contribute to antigen binding. 
The term "antibody" includes, for example, vertebrate 
antibodies, hybrid antibodies, chimeric antibodies, 
25 altered antibodies, univalent antibodies, the Fab 
proteins, and single domain antibodies. 

As used herein, a "single domain antibody" 
(dAb) is an antibody which is comprised of an VH domain, 
which reacts immunologically with a designated antigen. 
A dAB does not contain a VL domain, but may contain other 
antigen binding domains known to exist in antibodies, for 
example, the kappa and lambda domains. Methods for 
preparing dABs are known in the art. See, for example, 
Ward et al . (1989) . 
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Antibodies may also be comprised of VH and vl 
domains, as well as other known antigen binding domains. 
Examples of these types of antibodies and methods for 
their preparation are known in the art (see, e.g., u.S 
Patent No. 4,816,467, which is incorporated herein by 
reference), and include the following. For example, 
"vertebrate antibodies" refers to antibodies which are 
tetramers or aggregates thereof, comprising light and 
heavy chains which are usually aggregated in a "Y" 
configuration and which may or may not have covalent 
linkages between the chains, m vertebrate antibodies, 
the amino acid sequences of all the chains of a 
particular antibody are homologous with the chains found 
in one antibody produced by the lymphocyte which produces 
that antibody in situ, or in vitro (for example, in 
hybridomas) . Vertebrate antibodies typically include 
native antibodies, for example, purified polyclonal 
antibodies and monoclonal antibodies. Examples of the 
methods for the preparation of these antibodies are 
20 described infra. 

"Hybrid antibodies" are antibodies wherein one 
paxr of heavy and light chains is homologous to those in 
a fxrst antibody, while the other pair of heavy and light 
chains xs homologous to those in a different second anti- 

25 body. Typically, each of these two pairs will bind 

different epitopes, particularly on different antigens. 
Thxs results in the property of "divalence", i.e., the 
abxlxty to bind two antigens simultaneously. Such 
hybrids may also be formed using chimeric chains, as set 

3 0 forth below. 

"Chimeric antibodies", are antibodies in which 
the heavy and/or light chains are fusion proteins. 
Typxcally the constant domain of the chains is from one 
3s particular species and/or class, and the variable domains 
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are from a different species and/or class. Also included 
is any antibody in which either or both of the heavy or 
light chains are composed of combinations of sequences 
mimicking the sequences in antibodies of different 
5 sources, whether these sources be differing classes, or 
different species of origin, and whether or not the 
fusion point is at the variable/constant boundary. Thus, 
it is possible to produce antibodies in which neither the 
constant nor the variable region mimic known antibody 
10 sequences. It then becomes possible, for example, to 

construct antibodies whose variable region has a higher 
specific affinity for a particular antigen, or whose 
constant region can elicit enhanced complement fixation, 
or to make other improvements in properties possessed by 
15 a particular constant region. 

Another example is "altered antibodies", which 
refers to antibodies in which the naturally occurring 
amino acid sequence in a vertebrate antibody has been 
varied. Utilizing recombinant DNA techniques, antibodies 
can be redesigned to obtain desired characteristics. The 
possible variations are many, and range from the changing 
of one or more amino acids to the complete redesign of a 
region, for example, the constant region. Changes in the 
constant region, in general, to attain desired cellular 
25 process characteristics, e.g., changes in complement 

fixation, interaction with membranes, and other effector 
functions. Changes in the variable region may be made to 
alter antigen binding characteristics. The antibody may 
also be engineered to aid the specific delivery of a 
molecule or substance to a specific cell or tissue site. 
The desired alterations may be made by known techniques 
in molecular biology, e.g., recombinant techniques, site 
directed mutagenesis, etc. 

35 
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Yet another example are "univalent antibodies", 
which are aggregates comprised of a heavy chain/light 
chain dimer bound to the Fc (i.e., constant) region of a 
second heavy chain. This type of antibody escapes 
antigenic modulation. See, e.g., Glennie et al. (1982). 

Included also within the definition of antibod- 
ies are "Fab- fragments of antibodies. The "Fab" region 
refers to those portions of the heavy and light chains 
which are roughly equivalent, or analogous, to the 
sequences which comprise the branch portion of the heavy 
and light chains, and which have been shown to exhibit 
immunological binding to a specified antigen, but which 
lack the effector Fc portion . "Fab" includes aggregates 
of one heavy and one light chain (commonly known as 
Fab'), as well as tetramers containing the 2H and 2L 
chains (referred to as F(ab> 2 ), which are capable of 
selectively reacting with a designated antigen or antigen 
family. "Fab- antibodies may be divided into subsets 
analogous to those described above, i.e, "vertebrate 
20 Fab", "hybrid Fab", "chimeric Fab", and "altered Fab". 
Methods of producing "Fab" fragments of antibodies are 
known within the art and include, for example, 
proteolysis, and synthesis by recombinant techniques. 

The term "polypeptide" refers to a polymer of 
amino acids and does not refer to a specific length of 
the product; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
term also does not refer to or exclude post -expression 
modifications of the polypeptide, for example, 
glycosylations, acetylations, phosphorylations and the 
like. included within the definition are, for example, 
polypeptides containing one or more analogs of an amino 
acid (including, for example, unnatural amino acids, 
etc.), polypeptides with substituted linkages, as well as 
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other modifications known in the art, both naturally 
occurring and non-naturally occurring. 

"Transformation" , as used herein, refers to the 
insertion of an exogenous polynucleotide into a host 
5 cell, irrespective of the method used for the insertion, 
for example, direct uptake, transduction, f -mating or 
electroporation. The exogenous polynucleotide may be 
maintained as a non- integrated vector, for example, a 
plasmid, or alternatively, may be integrated into the 
10 host genome. 

"Treatment" as used herein refers to 
prophylaxis and/or therapy. 

By "immunogenic" is meant an agent used to 
stimulate the immune system of a living organism, so that 
15 one or more functions of the immune system are increased 
and directed towards the immunogenic agent. Immunogenic 
agents include vaccines. Immunogenic agents can be used 
in the production of antibodies, both isolated polyclonal 
antibodies and monoclonal antibodies, using techniques 
20 known in the art. 

By vaccine is meant an agent used to stimulate 
the immune system of a living organism so that protection 
against or amelioration of future harm is provided. 
Immunization refers to the process of inducing an 

25 increased level of antibodies and/or cellular immune 

response in which T- lymphocytes respond by killing the 
pathogen and/or activate other cells involved in the 
immune response pathway. The antibodies produced as a 
result of immunization may belong to any of the 

30 immunological classes, such as immunoglobulins A, D, E, G 
or M. 

An "individual", as used herein, refers to 
vertebrates, particularly members of the mammalian 
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species, and includes but is not limited to domestic 
animals, sports animals, and primates, including humans. 

As used herein, the "sense strand" of a nucleic 
acid contains the sequence that has sequence homology to 
that of mRNA. The -anti -sense strand" contains a 
sequence which is complementary to that of the "sense 
strand" . 

As used herein, the term "probe" refers to a 
polynucleotide which forms a hybrid structure with a 
sequence in a target region, due to complementarity of at 
least one sequence in the probe with a sequence in the 
target region. The polynucleotide regions of probes may 
be composed of DNA, and/or RNA, and/or synthetic 
nucleotide analogs. 

As used herein, the term "target region" refers 
to a region of the nucleic acid which is to be amplified 
and/or detected. The term "target sequence refers to a 
sequence with which a probe or primer will form a stable 
hybrid under desired conditions. 
20 The term "primer" as used herein refers to an 

oligomer which is capable of acting as a point of 
initiation of synthesis of a polynucleotide strand when 
placed under appropriate conditions. The primer will be 
completely or substantially complementary to a region of 
25 the polynucleotide strand to be copied. Thus, under 

conditions conducive to hybridization, the primer will 
anneal to the complementary region of the analyte strand. 
Upon addition of suitable reactants, (e.g., a polymerase, 
nucleotide triphosphates, and the like), the primer is 
extended by the polymerizing agent to form a copy of the 
analyte strand. The primer may be single- stranded, or 
alternatively may be partially or fully double- stranded. 

The terms "analyte polynucleotide" and "analyte 
3s strand" refer to a single- or double- stranded nucleic 
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acid molecule which is suspected of containing a target 
sequence, and which may be present in a biological 
sample. 

As used herein, a "biological sample" refers to 
a sample of tissue or fluid isolated from an individual, 
including but not limited to, for example, plasma, serum, 
spinal fluid, lymph fluid, the external sections of the 
skin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tumors, organs, and 
also samples of in vitro cell culture constituents 
(including but not limited to conditioned medium 
resulting from the growth of cells in cell culture 
medium, putatively virally infected cells, recombinant 
cells, and cell components) . 

As used herein, the term "oligomer" refers to 
primers and to probes. The term oligomer does not 
connote the size of the molecule. However, typically 
oligomers are no greater than 1000 nucleotides, more 
typically are no greater than 500 nucleotides, even more 
typically are no greater than 250 nucleotides; they may 
be no greater than 100 nucleotides, and may be no greater 
than 75 nucleotides, and also may be no greater than 50 
nucleotides in length. 

The term "coupled" as used herein refers to 
attachment by covalent bonds or by strong non- covalent 
interactions (e.g., hydrophobic interactions , hydrogen 
bonds, etc.). Covalent bonds may be, for example, ester, 
ether, phosphoester , amide, peptide, imide, carbon-sulfur 
bonds, carbon -phosphorus bonds, and the like. 

The term "support" refers to any solid or semi- 
solid surface to which a desired polypeptide or 
polynucleotide may be anchored. Suitable supports 
include glass, plastic, metal, polymer gels, and the 
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like, and may take the form of beads, wells, dipsticks, 
membranes, and the like. 

The term "label" as used herein refers to any 
atom or moiety which can be used to provide a detectable 
(preferably quantifiable) signal, and which can be 
attached to a polynucleotide or polypeptide. 

The term "type 4 pilin" as used herein refer to 
pilins that contain a conserved amino terminal 
hydrophobic domain beginning with an amino- terminal 
phenylalanine that is methylated upon processing and 
secretion of the pilin. Another characteristic feature 
of type 4 pilins is that in the propilin form they 
contain similar six- or seven-amino acid long leader 
peptides, which are much shorter than typical signal 
15 sequences. Type 4 pilins are expressed by several 
bacterial genuses, including Neisseria, Moraxella, 
Bacteroides, and Pseudomonas. Species within these 
genuses which express type 4 pilins are, for example, N. 
gonorrhoeae, N. meningitidis, M. bovis, B. nodosus, and 
20 p. aeruginosa. As used herein, the term "type 4 pilin" 
also includes the Tcp pilin of ViJbrio, (for example, v. 
cholerae) , that is highly homologous to the type 4 pilins 
of other genuses. Tcp pilin contains the characteristic 
amino- terminal hydrophobic domain as well as having a 
25 modified N- terminal amino acid that in this case may be a 
modified methionine because the Tcp pilin gene encodes a 
methionine residue at the position where all the others 
encode a phenylalanine. Precursor TcpA contains a much 
longer leader sequence than typical type 4 propilins but 
retains homology in the region surrounding the processing 
site. 

The term "pile as used herein refers to a gene 
encoding a polypeptide involved in the assembly of type 4 
pilin, which may also be required for attachment of the 
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pilin, and that is comprised of an epitope that is 
immunologically identif iable with an epitope in Pile of 
N. gonorrhae or N. meningitidis. Included within this 
term is any homologous region from Vibrio, tcpC. 

As used herein the term "PilC" refers to a 
polypeptide encoded within pile, and includes TcpC of 
Vibrio. 

The description of the method to retrieve the 
DNA sequences is mostly of historical interest. The 
resultant sequences (and their complements) are provided 
herein, and the sequences, or any portion thereof, could 
be prepared using synthetic methods, or by a combination 
of synthetic methods with retrieval of partial sequences 
using methods similar to those described herein . 

The description infra, of "walking" the genome 
by isolating overlapping DNA sequences from the N. 
gonorrheal lambda gt-11 library and from an EMBL3 library 
provides one method by which DNAs corresponding to the 
pile genomes from, inter alia, N. gonorrheae and N. 
meningitidis, respectively, may be isolated. However, 
given the information provided herein, other methods for 
isolating pile DNAs from these species, as well as from 
species of other genuses which have type 4 pilin are 
obvious to one of skill in the art. 

Characterization of the genes of the pile loci 
has provided information on the polypeptides encoded 
therein, and on the control of their expression. 
Even though Type 4 pili have been extensively studied in 
several laboratories, little is known about their 
assembly. The presence of a specific assembly machinery 
for this class of pili is evident from the fact that the 
pilin gene of B. nodosus and Af. Jbovis can be properly 
processed and assembled into a pilus in P. aeruginosa but 
not in E. coli (Ellerman et al., 1986; Mattick et al., 
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1987; Beard et al., 1990) . Furthermore, the recent 
genetic characterization of TCP pili of Vibrio cholerae 
has revealed that a number of closely linked genes are 
required for pilin processing and assembly into a 
5 structure (Taylor et al., 1988). The TCP pilin does not 
carry an N-methylphenylalanine but its primary sequence 
is highly homologous to the Type 4 class of pilins. 

The N. gonorrhoeae pilus facilitates adherence 
of the bacterium to a number of eukaryotic cell types 
10 (watt et al., i960) and is thought to play a role in 

bacterial interaction with neutrophils (Fischer and Rest, 
1988) . The pilin is encoded from one or two pilE loci 
(Meyer et al., 1984; Swanson et al., 1986) which most 
likely each form a monocistronic operon. Hence, there 
have been no suggestions that genes closely linked to 
pilB are involved in pilus assembly. A dispersed 
location of genes involved in gonococcal pilus assembly 
as well as the rapid occurrence of nonpiliated variants 
generated via recombination with pilin sequences from 
silent loci, piis, have made it extremely difficult to 
identify putative assembly genes for gonococcal pili. 

The Pile protein described herein is a protein 
encoded within a pile or equivalent (for example, tcpC) 
locus or gene. In N. gonorrhoeae MSH and most other 
gonococcal strains the Pile protein is expressed in small 
amounts. it is the only protein that is enriched in 
highly purified preparations of MS11 pili. Pile was not 
released from a nonpilated MSll (p- n ) variant using the 
same procedure suggesting that this protein interacts 
with the polymerized pilus fiber. 

DNA sequence analysis of the cloned pilci gene 
revealed one long open reading frame that was out of 
frame with its putative AUG initiation codon and 5' end 
encoding the signal peptide. Minute amounts of Pile were 
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expressed in E. coli from pABJ04 . Gel purified PilC from 
MS11 contained a lysine residue in position four, whereas 
pilCl had a glutamine codon at this position . A lysine 
codon was, however, found at position four in a number of 
5 PCR amplified 5' pile fragments suggesting that these 

fragments represent the 5' end of pilC2, which then must 
be ON in MSll. The finding that a miniTnCm insertion in 
pilC2 abolished PilC expression, whereas insertional 
inactivation of pilCl did not abolish PilC expression 

10 further argues that pilCl is translationally out of frame 
and pilC2 translationally in frame in the MSll variant we 
are studying. 

PCR amplified fragments of pilCl and pilC2 in 
MSll differed in the number of G residues found in the G 

15 tract. Only 11 or 12 Gs were found in pilCl clones 

(which would both generate an OFF phenotype) while 12 or 

13 Gs were found among pilC2 specific clones. Since 
pilC2 is the expressed gene in the MSll variant under 
study, we believe that this variant carries 13 Gs in 

20 pilC2 and 12 Gs in pilCl. The frequency of frameshift 

mutations in each locus is not known. However, the lack 
of 13 Gs among pilCl specific fragments and the lack of 

14 Gs among p±lC2 specific fragments suggests that a 
deletion of one G residue occurs at a higher frequency 

25 than the insertion of one G residue. We had expected to 
find amplified fragments from N. gonorrhoeae containing 
10 G residues in the G tract, but found none in the 48 
clones sequenced. If only one G is added or deleted in 
each mutational event, the frequency of G tracts with 10 

3 0 residues should be low if G tracts normally are 12 or 13 
bp long. 

Frameshif ting in pilCl also occurred in E. 
coli. In this case, however, two variants with 10 
residues were found out of 12 clones sequenced. It may 
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therefore be that there is a selection against in frame 
variants with 10 Gs in N. gonorrhoeae. A change from 
five glycines to four in the signal peptide may for 
example have an effect on the physical properties of the 
precursor form of Pile such that the signal peptide is 
not cleaved off. b. coli strain AA10 is recA. 
Therefore, frameshift mutations in the G tract of pile 
occurs independent of the RecA protein. 

Translational f rameshif ting has been shown to 
regulate phase and antigenic variation of the gonococcal 
opacity protein PII that is encoded by a number of opa 
loci showing sequence variations. In this system a 
number of pentameric CTCTT repeats are present in the 
region encoding the signal peptide (Stern et al. 1986). 
Variation in the number of repeats is independent of recA 
in N. gonorrhoeae as well as in E. coll (Murphy et al . , 
1989) . Variation in the expression of lipopoly saccharide 
epitopes in Haemophilus Influenzae was recently explained 
by translational f rameshif ting created by alterations in 
the number of CAAT repeats occurring in the 5' end of 
licA (Weiser et al., 1989). In Bordetella pertussis 
frameshift mutations in the regulatory vir locus occur in 
a run of C residues positioned internally in the gene 
(Stibitz et al., 1989). The C tract was in this case 
varying from 6 (in frame) to 7 residues (out of frame) . 
It is not known if this frameshift mutation is programed 
or not. The pilin gene of Bordetella pertussis was 
recently shown to be preceded by a stretch of Cs. 
Frequent mutations affecting the length of this C tract 
influenced the transcriptional activity of the pilin gene 
(Willems et al . , 1990). 

Variation in the number of the CTCTT repeats in 
opa genes was recently suggested to be due to 
recombination- independent slipped strand mispairing 
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(Murphy et al . , 1989). Mispairing is thought to occur 
between strands subjected to local denaturation and 
should preferentially occur during replication. A number 
of unusual DNA structures (cruciform, 2 form, H form) 
5 have been shown to form in vitro within a variety of 
specific DNA sequences. Under normal conditions the B 
form is the most favorable thermodynamically ( Frank - 
Kamenetslcii and Vologodskii, 1984) . Transition to 
alternative conformations requires specific external 

10 conditions, supercoiling being the most physiologic. 
Single stranded <dG) n and (dC) n strands renature more 
slowly than complementary strands with arbitrary 
sequences, and methylation experiments suggest that a 
poly dG chain may form a hairpin- like structure 

15 stabilized by G-G bp (Panyutin et al., 1990). 

Four variant sequences differing outside the G 
tract were obtained by PCR amplif ication of the 5' end of 
pile from four N. gonorrhoeae strains. The region 5' of 
the G tract was invariant, as was the 3' end of the 

20 amplified region. All variation was confined to a region 
located 3' of the G tract. At least some of these 
sequence variations can be explained by mismatch pairing 
events. Thus, the addition of four nucleotides distal to 
the G tract in variant sequence 4 is possible to explain 

25 by a two step mispairing event occurring within variant 
sequence 2. Slip strand mispairing between the two CA 
residues in -GGCGCAGGCGCA- would yield 
-GGCGCAGGCGCAGGCGCA- . A second mispairing event 
occurring between the two C- residues at positions 3 and 5 

30 gives rise to the sequence -GGCAGGCGCAGGCGCA- present in 
variant 4 . It may therefore be that a sequence close to 
a poly(G) tract is prone to slipped strand mispairing. 

Gonococcal pilus phase variation is associated 

^ with an altered nucleotide sequence of pilE via 
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recombinations with silent pilS sequences (Haas and 
Meyer, 1986, Swanson et al., 1986). An irreversible 
switch OFF in pilus expression results from deletions of 
the 5' coding and control regions of the pilE locus 
5 (Swanson et al., 1985). Reversible gonococcal pilus 

phase variation is associated with nucleotide changes in 
pilB resulting in an altered pilin product. It has been 
suggested that the pilins of these variants are assembly 
defective (Bergstrom et al., 1986; Swanson et al., 1986; 
10 Hill et al., 1990). Here we present evidence that switch 
OFF and ON of Pile expression causes pilus expression to 
phase vary. Five out of five P", pilin producing 
descendants from MSll^ (P + , Piic + ) that expressed pilin 
did not express Pile. All tested P + revertants from the 
15 five P~, Pile" variants had regained expression of Pile. 
The pilin of one nonpiliated Pile OFF- switcher (variant 
8) differed by eight amino acids from that of the parent. 
The fact that one piliated Pile* backswitcher (8:1) 
expressed a pilin identical in sequence to the 
20 nonpiliated variant (8) strongly suggests that the 

regained expression of pili is due to an ON- switch in 
PilC expression. The above results also imply that the 
nonpiliated phenotype of variant 8 is not due to the 
alterations in the pilin relative to the parental strain 
25 but to an OFF- switch of Pile. The finding that mTnCm 
insertions resulted in P + colonies when inserted into 
pilCl and P" colonies when inserted into the actively 
expressing pilC2 locus offers further evidence that PilC 
is essential for the biogenesis of gonococcal pili. p - , 
pilC2: :mTnCm-l2 insertion mutants reverted to P + colony 
morphology at a low frequency. These revertants most 
likely represent f rameshif ting mutants in pilCl resulting 
in expression of Pile from this locus. A double mutant 
in pilCl and pilC2 was stably nonpiliated, expressed 
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pilin, but expressed amounts of pilin that did not 
express any pili when examined by transmission electron 
microscopy. It is therefore believed that out of frame 
mutations of both pilCl and pilC2 will abolish pili 
5 formation. 

At this stage we cannot exclude the possibility 
that some Pile" variants from MSII^ (P + , PilC" ) are 
generated by transformation of pilCl sequences and 
homologous recombination with pilC2 thus generating 

10 variants with two pilCl 5' ends at both pile loci. PilC + 
revertants from PilC" clones must, however, all be due to 
frameshift mutations in either pilCl or pilC2. 

We propose that PilC forms an outer membrane 
pore or assembly center enabling the pilin subunits to be 

15 assembled and translocated across the outer membrane 

analogous to the proposed function of the high molecular 
weight proteins required for the assembly of 
enterobacterial pili (the latter of which is discussed in 
Norgren et al. f 1987). Alternatively, PilC may act as an 

20 initiator for polymerization. In the latter case PilC 
would be expected to be located at the tip of the 
polymerized pilus. 

It is possible that the alternate expression of 
PilC from two structurally different pile loci is yet 

25 another example of antigenic variation in Neisseria, 
g-onorrhoeae . It is, however, possible that this 
variation could have functional implications as well. 
Each class of E. coli pili utilizes a different outer 
membrane pore/assembly protein. Hence, pilin subunits 

3 0 and/or periplasmic chaperone complexes may specifically 
interact with an exposed region of the protein allowing 
polymerization of pilus subunit proteins. The repertoire 
of antigenic variants of gonococcal pilins is vast 
(Hagblom et al., 1985). It may be that only certain 
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pilin variants are assembled via PilCl and PilC2 
respectively. Alternatively, if Pi i c acts as an 
initiator it could also possess other properties such as 
being involved in Pilus mediated attachment. 

In one embodiment of the invention, 
immunogenically active polypeptides encoded within pile 
are prepared. The availability of pile DNA sequences, 
either those isolated by utilizing the DNA sequences 
described in the Examples, or nucleotide sequences 
derived therefrom (including segments and modifications 
of the sequence) , permits the construction of expression 
vectors encoding immunologically reactive regions of the 
polypeptide encoded in either strand. Immunological 
reactivity may be determined by immunoassay using 
15 antibodies raised to Pile. Fragments encoding the 

desired polypeptides are derived from the DNA clones 
using conventional restriction digestion or by synthetic 
methods, and are ligated into vectors which may, for 
example, contain portions of fusion sequences such as 
beta-galactosidase or superoxide dismutase (SOD) , prefer- 
ably SOD. Methods and vectors which are useful for the 
production of polypeptides which contain fusion sequences 
of SOD are described in European Patent Office 
Publication number 0196056, published October i, 1986 
Any desired portion of the pile DNA containing an open 
reading frame, in either sense strand, can be obtained as 
a recombinant polypeptide, such as a mature or fusion 
protein; alternatively, a polypeptide encoded in the DNA 
can be provided by chemical synthesis. 

The DNA encoding the desired polypeptide 
whether in fused or mature form, and whether or not 
containing a signal sequence to permit secretion, may be 
ligated into expression vectors suitable for any 
3s convenient host. Both eukaryotic and prokaryotic host 
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systems are presently used in forming recombinant 
polypeptides, and a summary of some of the more common 
control systems and host cell lines is given infra. The 
polypeptide is then isolated from lysed cells or from the 
culture medium and purified to the extent needed for its 
intended use. Purification may be by techniques known in 
the art, for example, differential extraction, salt 
fractionation, chromatography on ion exchange resins, 
affinity chromatography, centrif ugation, and the like. 
See, for example, Methods in Enzymology for a variety of 
methods for purifying proteins. Such polypeptides can be 
used as diagnostics, or those which give rise to 
neutralizing antibodies may be formulated into vaccines. 
Antibodies raised against these polypeptides can also be 
used as diagnostics, or for passive immunotherapy. 

The Pile antigens may also be isolated from 
meningococci and from gonococci. The bacteria may be 
grown by conditions known in the art, some of which are 
described infra. In addition, a method for isolating 
PilC from gonococci is described infra. 

In another embodiment of the invention, the 
immunoreactive polypeptides may be conjugated with 
carrier. An antigenic region of a polypeptide is gener- 
ally relatively small - -typically 8 to 10 amino acids or 
less in length. Fragments of as few as 5 amino acids may 
characterize an antigenic region. These segments may 
correspond to regions of PilC antigen. Accordingly, 
using the DNAs of pxlC as a basis, DNAs encoding short 
segments of PilC polypeptides can be expressed 
recombinantly either as fusion proteins, or as isolated 
polypeptides. In addition, short amino acid sequences 
can be conveniently obtained by chemical synthesis. In 
instances wherein the synthesized polypeptide is 
correctly configured so as to provide the correct 
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epitope, but is too small to be immunogenic, the 
polypeptide may be linked to a suitable carrier. 

A number of techniques for obtaining such link- 
age are known in the art, including the formation of 
disulfide linkages using 

N- succinimidyl- 3 - (2-pyridyl-thio) propionate (SPDP) and 
succinimidyl 

4- W-maleimido-methyDcyclohexane-i-carboxylate (SMCC) 
obtaxned from Pierce Company, Rockford, Illinois, (if the 
peptide lacks a sulfhydryl group, this can be provided by 
addition of a cysteine residue.) These reagents create a 
disulfide linkage between themselves and peptide cysteine 
residues on one protein and an amide linkage through the 
epsilon-amino on a lysine, or other free amino group in 
the other. A variety of such disulf ide/amide- forming 
agents are known. See, for example, Immun. Rev. (i 98 2) 
£2:185. other Afunctional coupling agents form a 
thioether rather than a disulfide linkage. Many of these 
thio-ether- forming agents are commercially available and 
include reactive esters of 6-maleimidocaproic acid, 
2-bromoacetic acid, 2-iodoacetic acid, 

4-(N-maleimido-methyl)cyclohexane-l-carboxylic acid, and 
the like. The carboxyl groups can be activated by 
combining them with succinimide or 
l-hydroxyl- 2 -nitro-4-sulfonic acid, sodium salt 
Additional methods of coupling antigens employs the 
rotavirus/ "binding peptide" system described in EPO Pub 
No. 259,149, the disclosure of which is incorporated 
herein by reference. The foregoing list is not meant to 
be exhaustive, and modifications of the named compounds 
can clearly be used. 

Any carrier may be used which does not itself 
induce the production of antibodies harmful to the host 
3s Suitable carriers are typically large, slowly metabolized 
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macromolecules such as proteins; polysaccharides, such as 
latex functionalized sepharose, agarose, cellulose, cel- 
lulose beads and the like; polymeric amino acids, such as 
polyglutamic acid, polylysine, and the like; amino acid 
copolymers; and inactive virus particles or attenuated 
bacteria of other strains, for example, those of 
Salmonella. Especially useful protein substrates are 
serum albumins, keyhole limpet hemocyanin, immunoglobulin 
molecules, thyroglobulin, ovalbumin, tetanus toxoid, and 
other proteins well known to those skilled in the art. 

In addition to full-length Pile proteins, 
polypeptides comprising truncated PilC amino acid 
sequences encoding at least one immunologically reactive 
epitope are useful immunological reagents- For example, 
polypeptides comprising such truncated sequences can be 
used as reagents in an immunoassay. These polypeptides 
also are candidate subunit antigens in compositions for 
antiserum production or vaccines. While these truncated 
sequences can be produced by various known treatments of 
native bacterial protein, it is generally preferred to 
make synthetic or recombinant polypeptides comprising a 
PilC sequence. Polypeptides comprising these truncated 
PilC sequences can be made up entirely of PilC sequences 
(one or more epitopes, either contiguous or 
noncontiguous) , or Pile sequences and heterologous 
sequences in a fusion protein. Useful heterologous 
sequences include sequences that provide for secretion 
from a recombinant host, enhance the immunological 
reactivity of the PilC epitope(s), or facilitate the 
coupling of the polypeptide to an immunoassay support or 
a vaccine carrier. See, e.g., EPO Pub. No, 116,201; 
U.S. Pat. No. 4,722,840; EPO Pub. No. 259,149; U.S. 
Pat. No. 4,629,783, the disclosures of which are 
incorporated herein by reference. 
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The size of polypeptides comprising the 
truncated Pile sequences can vary widely, the minimum 
size being a sequence of sufficient size to provide an 
immunologically reactive Pile epitope, while the maximum 
5 size is not critical. For convenience, the maximum size 
usually is not substantially greater than that required 
to provide the desired Pile epitopes and function (s) of 
the heterologous sequence, if any. Typically, the 
truncated Pile amino acid sequence will range from about 
10 5 to about 100 amino acids in length. More typically, 

however, the Pile sequence will be a maximum of about 50 
amino acids in length, preferably a maximum of about 30 
amino acids, it is usually desirable to select Pile 
sequences of at least about 10, 12 or 15 amino acids, up 
15 to a maximum of about 20 or 25 amino acids. 

Truncated Pile amino acid sequences comprising 
epitopes can be identified in a number of ways. For 
example, the entire Pile protein sequence can be screened 
by preparing a series of short peptides that together 
span the entire protein sequence. By starting with, for 
example, lOOmer polypeptides, it would be routine to test 
each polypeptide for the presence of epitope (s) showing a 
desired reactivity, and then testing progressively 
smaller and overlapping fragments from an identified 
25 lOOmer to map the epitope of interest. Screening such 
peptides in an immunoassay is within the skill of the 
art. It is also known to carry out a computer analysis 
of a protein sequence to identify potential epitopes, and 
then prepare oligopeptides comprising the identified 
regions for screening. 

In another embodiment of the invention, the 
immunogenicity of the epitopes of Pile may also be 
enhanced by preparing them in mammalian or yeast systems 
fused with or assembled with particle- forming proteins 
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such as, for example, that associated with hepatitis B 
surface antigen. See, e.g., U.S. Pat. No. 4,722,840. 
Constructs wherein the Pile epitope is linked directly to 
the particle- forming protein coding sequences produce 
hybrids which are immunogenic with respect to the Pile 
epitope. in addition, all of the vectors prepared 
include epitopes specific to HBV, having various degrees 
of immunogenicity, such as, for example, the pre-S 
peptide. Thus, particles constructed from particle 
forming protein which include Pile sequences are 
immunogenic with respect to the microorganism encoding 
the Pile epitope (for example, Neisseria, Vibrio, 
Moraxella, Bacteroides, or Pseudomonas) and HBV. 

Hepatitis surface antigen (HBSAg) has been 
shown to be formed and assembled into particles in S. 
cerevisiae (Valenzuela et al. (1982)), as well as in, for 
example, mammalian cells (Valenzuela, P., et al. (1984)). 
The formation of such particles has been shown to enhance 
the immunogenicity of the monomer subunit. The 
constructs may also include the immunodominant epitope of 
HBSAg, comprising the 55 amino acids of the presurface 
(pre-S) region. Neurath et al. (1984). Constructs of 
the pre-S -HBSAg particle expressible in yeast are 
disclosed in EPO 174,444, published March 19, 1986; 
hybrids including heterologous viral sequences for yeast 
expression are disclosed in EPO 175,261, published March 
26, 1966. These constructs may also be expressed in 
mammalian cells such as Chinese hamster ovary (CHO) cells 
using an SV40-dihydrof olate reductase vector (Michelle et 
30 al. (1984) ) . 

In addition, portions of the particle- forming 
protein coding sequence may be replaced with codons 
encoding a Pile epitope. m this replacement, regions 
which are not required to mediate the aggregation of the 
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units to form immunogenic particles in yeast or mammals 
can be deleted, thus eliminating additional HBV antigenic 
sites from competition with the PilC epitope. 

In another embodiment of the invention, the 
immunoreactive polypeptides encoded in pile are prepared 
into vaccines. Vaccines may be prepared from one or more 
immunogenic polypeptides derived from pile. If 
recombinant, these polypeptides may be expressed in 
various host cells (e.g., bacteria, yeast, insect, or 
mammalian cells) , or alternatively may be isolated from 
the bacterial preparations. In addition to the above, it 
is also possible to prepare live vaccines of attenuated 
microorganisms which express one or more recombinant 
polypeptides derived from the pile gene. Suitable 
15 attenuated microorganisms are known in the art and 

include, for example, viruses (e.g., vaccinia virus (see 
Brown et al. (1986)), as well as bacteria. 

The preparation of vaccines which contain an 
immunogenic polypeptide (s) as active ingredients, is 
20 known to one skilled in the art. Typically, such 

vaccines are prepared as in jec tables, either as liquid 
solutions or suspensions; solid forms suitable for solu- 
tion in, or suspension in, liquid prior to injection may 
also be prepared. The preparation may also be 
25 emulsified, or the protein encapsulated in liposomes. 

The active immunogenic ingredients are often mixed with 
excipients which are pharmaceutically acceptable and 
compatible with the active ingredient. Suitable 
excipients are, for example, water, saline, dextrose, 
glycerol, ethanol, or the like and combinations thereof. 
In addition, if desired, the vaccine may contain minor 
amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants 
which enhance the effectiveness of the vaccine. Examples 
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of adjuvants which may be effective include but are not 
limited to: aluminum hydroxide, N- acetyl -muramyl - 
L- threonyl-D-isoglutamine (thr-MDP) , N-acetyl -nor- 
muramyl -L-alanyl-D-isoglutamine (CGP 11637, referred to 
5 as nor-MDP) , N-acetylmuramyl-L-alanyl -D- isoglutaminyl - 

L-alanine-2- (l'-2' -dipalmitoyl-sn-glycero-3-hydroxyphosph 
oryloxy) -ethylamine (CGP 19835A, referred to as MTP-PE) , 
and RIBI, which contains three components extracted from 
bacteria, monophosphoryl lipid A # trehalose dimycolate 

10 and cell wall skeleton (MPL+TDM+CWS) in a 2% 

squalene/Tween 8 0 emulsion. The effectiveness of an 
adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing a PilC immunoreactive sequence resulting from 

15 administration of this polypeptide in vaccines which are 
also comprised of the various adjuvants. 

The vaccines are conventionally administered 
parenterally, by injection, for example, either 
subcutaneously or intramuscularly. Additional 

20 formulations which are suitable for other modes of 

administration include suppositories and, in some cases, 
oral formulations. For suppositories, traditional 
binders and carriers may include, for example, 
polyalkylene glycols or triglycerides; such suppositories 

25 may be formed from mixtures containing the active 

ingredient in the range of 0.5% to 10%, preferably l%-2%. 
Oral formulations include such normally employed 
excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium 

3 0 saccharine, cellulose, magnesium carbonate, and the like. 
These compositions take the form of solutions, suspen- 
sions, tablets, pills, capsules, sustained release 
formulations or powders and contain 10% -95% of active 
ingredient, preferably 25%- 70%. 

35 
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The proteins may be formulated into the vaccine 
as neutral or salt forms. Phannaceutically acceptable 
salts include the acid addition salts (formed with free 
amino groups of the peptide) and which are formed with 
5 inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or such organic acids such as acetic, 
oxalic, tartaric, maleic, and the like. Salts formed 
with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, 
10 ammonium, calcium, or ferric hydroxides, and such organic 
bases as isopropylamine, t rime thyl amine , 2-ethylamino 
ethanol, histidine, procaine, and the like. 

The vaccines are administered in a manner 
compatible with the dosage formulation, and in such 
amount as will be prophylactically and/ or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 micrograms to 250 micrograms 
of antigen per dose, depends on the subject to be 
treated, capacity of the subject's immune system to 
synthesize antibodies, and the degree of protection 
desired. Precise amounts of active ingredient required 
to be administered may depend on the judgment of the 
practitioner and may be peculiar to each subject. 

The vaccine may be given in a single dose 
25 schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course 
of vaccination may be with l-io separate doses, followed 
by other doses given at subsequent time intervals 
required to maintain and or reenforce the immune 
3 0 response, for example, at 1-4 months for a second dose, 

and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at least in part, be 
determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 
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In addition, the vaccine containing the im- 
munogenic antigen (s) derived from pile may be 
administered in conjunction with other immunoregulatory 
agents, for example, immune globulins. 
5 Another embodiment of the invention are 

antibodies which react immunologically with PilC 
epitopes. The immunogenic polypeptides prepared as 
described above are used to produce antibodies, including 
polyclonal and monoclonal. If polyclonal antibodies are 

10 desired, a selected mammal (e.g., mouse, rabbit, goat, 

horse, etc.) is immunized with an immunogenic polypeptide 
bearing a PilC epitope (s) . Serum from the immunized 
animal is collected and treated according to known 
procedures. If serum containing polyclonal antibodies to 

15 a PilC epitope (i.e., an epitope encoded within pilC) 
contains antibodies to other antigens, the polyclonal 
antibodies can be purified by immunoaf f inity 
chromatography. Techniques for producing and processing 
polyclonal antisera are known in the art, see for 

20 example, Mayer and Walker (19 87) . 

Monoclonal antibodies directed against PilC 
epitopes can also be readily produced by one skilled in 
the art. The general methodology for making monoclonal 
antibodies by hybridomas is well known. Immortal 

25 antibody -producing cell lines can be created by cell 
fusion, and also by other techniques such as direct 
transformation of B lymphocytes with oncogenic DNA, or 
transfection with Epstein-Barr virus. See, e.g., M. 
Schreier et al . (1980); Hammerling et al . (1981); Kennett 

30 et al. (1980); 3££ alfiffl, U.S. Patent Nos. 4,341,761; 

4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 
4,491,632; and 4,493,890. Panels of monoclonal 
antibodies produced against PilC epitopes can be screened 
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for various properties; i.e., for isotype, epitope affin- 
ity, etc. 

Antibodies, both monoclonal and polyclonal, 
which are directed against Pile epitopes are particularly 
5 useful in diagnosis, and those which are neutralizing are 
useful in passive immunotherapy. Methods for introducing 
antibodies into an individual to accomplish passive 
immunotherapy are known in the art. In addition, 
monoclonal antibodies, in particular, may be used to 
10 raise anti- idiotype antibodies. 

Anti-idiotype antibodies are immunoglobulins 
which carry an "internal image" of the antigen of the 
infectious agent against which protection is desired. 
See, for example, Nisonoff, A., et al. (1981) and 
15 Dreesman et al . (1985). Techniques for raising 

anti-idiotype antibodies are known in the art. See, for 
example, Grzych (1985), MacNamara et al. (1984), and 
Uytdehaag et al. (1985). These anti-idiotype antibodies 
may also be useful for treatment, vaccination and/or 
20 diagnosis of the relevant microorganism encoding the 
antigen of interest, (for example, Neisseria, 
Pseudomonas, Moraxella, Bacteroides, or Vibrio) as well 
as for an elucidation of the immunogenic regions of Pile. 

Another embodiment of the invention concerns 
25 immunoassays and diagnostic kits. The polypeptides which 
contain epitopes encoded in pile which are immunoreactive 
with anti -Pile antibodies in biological samples are 
useful in immunoassays to detect presence of anti -Pile 
antibodies, or the presence of the relevant microorganism 
or its antigens in biological samples. Design of the 
immunoassays is subject to a great deal of variation, and 
many formats are known in the art. The immunoassay will 
utilize a polypeptide comprised of at least one epitope 
derived from Pile or encoded in pile. In one embodiment, 
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the immunoassay uses a combination of epitopes including 
the one derived from Pile or encoded in pile. These 
epitopes may be derived from the same or from different 
polypeptides, and may be in separate recombinant or 
natural polypeptides, or together in the same recombinant 
polypeptides. An immunoassay may use, for example, a 
monoclonal antibody directed towards an epitope (s), a 
combination of monoclonal antibodies directed towards 
epitopes of one antigen, monoclonal antibodies directed 
towards epitopes of different antigens, polyclonal 
antibodies directed towards the same antigen, or 
polyclonal antibodies directed towards different 
antigens. Protocols may be based, for example, upon 
competition, or direct reaction, or sandwich type assays. 
Protocols may also, for example, use solid supports, or 
may be by immunoprecipitation. Most assays involve the 
use of labeled antibody or polypeptide; the labels may 
be, for example, enzymatic, fluorescent, 

chemiluminescent , radioactive, or dye molecules. Assays 
which amplify the signals from the probe are also known; 
examples of which are assays which utilize biotin and 
avidin, and enzyme- labeled and mediated immunoassays, 
such as ELISA assays. 

Typically, an immunoassay for an anti-PilC 
antibody (s) will involve selecting and preparing the test 
sample suspected of containing the antibodies, such as a 
biological sample, then incubating it with an 
immunoreactive (also called antigenic) polypeptide (s) 
containing at least one epitope encoded in pile. The 
incubation is under conditions that allow 
antigen-antibody complexes to form. Suitable incubation 
conditions are well known in the art. Subsequent to the 
incubation, complexes which are formed which contain the 
immunoreactive polypeptide are detected. The immunoassay 
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may be, without limitations, in a heterogenous or in a 
homogeneous format, and of a standard or competitive 
type. 

In a heterogeneous format, the polypeptide is 
typically bound to a solid support to facilitate separa- 
tion of the sample from the polypeptide after incubation. 
Examples of solid supports that can be used are nitro- 
cellulose (e.g., in membrane or microtiter well form), 
polyvinyl chloride (e.g., in sheets or microtiter wells), 
polystyrene latex (e.g., in beads or microtiter plates, 
polyvinylidine fluoride (known as Immulon) , diazotized 
paper, nylon membranes, activated beads, and Protein A 
beads. For example, Dynatech Immulon 1 or Immulon 2 
microtiter plates or 0.25 inch polystyrene beads (Preci- 
sion Plastic Ball) can be used in the heterogeneous 
format. The solid support containing the antigenic 
polypeptide is typically washed after separating it from 
the test sample, and prior to detection of bound antibod- 
ies. Both standard and competitive formats are known in 
the art. 

In a homogeneous format, the test sample is 
incubated with antigen in solution. For example, it may 
be under conditions that will precipitate any 
antigen-antibody complexes which are formed. Both 
standard and competitive formats for these assays are 
known in the art. 

In a standard format, the amount of anti-Pile 
antibodies forming the antibody -antigen complex is 
directly monitored. This may be accomplished by 
determining whether labeled ant i -xenogeneic (e.g., 
anti -human) antibodies which recognize an epitope on 
anti-Pile antibodies will bind due to complex formation. 
In a competitive format, the amount of anti -Pile 
antibodies in the sample is deduced by monitoring the 
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competitive effect on the binding of a known amount of 
labeled antibody (or other competing ligand) in the 
complex. 

Complexes formed comprising anti-PilC antibody 
5 (or, in the case of competitive assays, the amount of 
competing antibody) are detected by any of a number of 
known techniques, depending on the format. For example, 
unlabeled anti-PilC antibodies in the complex may be 
detected using a conjugate of antixenogeneic Ig complexed 

10 with a label, (e.g., an enzyme label). 

In immunoassays where PilC polypeptides are the 
analyte, the test sample, which may be a biological 
sample, is incubated with anti-PilC antibodies under 
conditions that allow the formation of antigen -antibody 

IS complexes. Various formats can be employed. For 
example, a "sandwich assay" may be employed, where 
antibody bound to a solid support is incubated with the 
test sample; washed; incubated with a second, labeled 
antibody to the analyte, and the support is washed again. 

20 Analyte is detected by determining if the second antibody 
is bound to the support. In a competitive format, which 
can be either heterogeneous or homogeneous, a test sample 
is usually incubated with antibody and a labeled, 
competing antigen is also incubated, either sequentially 

25 or simultaneously. These and other formats are well 
known in the art. 

The antigenic regions of the polypeptides 
encoded in pile can be mapped and identified by screening 
the antigenicity of expression products of pile DNAs 

30 which encode portions of the PilC, The expression 

products may be from a variety of expression systems, 
including, for example bacterial systems, yeast systems, 
insect systems, and eukaryotic cell systems. In 
addition, studies giving rise to an antigenicity index 
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and hydrophobicity/hydrophilicity profile give rise to 
information concerning the probability of a region's 
antigenicity. 

Efficient detection systems for infection with 
pathogenic microorganisms, (for example, Neisseria, 
PseudomonsLS , Bacteroides, Moraxella, or Vibrio) may 
include the use of panels of epitopes. The epitopes in 
the panel may be constructed into one or multiple 
polypeptides. At least one of the epitopes will be 
encoded in pile or derived from PilC. The assays for the 
varying epitopes may be sequential or simultaneous. 

Kits suitable for immunodiagnosis and contain- 
ing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the 
polypeptides of the invention containing PilC epitopes 
(i.e., epitopes encoded within pilC) or antibodies 
directed against PilC epitopes in suitable containers. 
The kit may also contain other reagents, for example, 
buffer and standard, as well as other materials required 
for the conduct of the assay, as well as a suitable set 
of instructions for conducting the assay using the kit 
materials. 

Another embodiment of the invention are 
oligomers. Using the disclosed portions of the pilC DNAs 
as a basis, oligomers of approximately 8 nucleotides or 
more can be prepared, either by excision from recombinant 
polynucleotides or by synthetic methods which are known 
in the art. These oligomers can serve as probes for the 
detection (including isolation and/or labeling) of 
polynucleotides which contain pile sequences, and/or as 
primers for the transcription and/or replication of 
targeted pilC sequences. The oligomers contain a 
targeting polynucleotide sequence, which is comprised of 
nucleotides which are complementary to a target pilC 
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nucleotide sequence; the sequence is of sufficient length 
and complementarity with the pile sequence to form a 
duplex which has sufficient stability for the purpose 
intended. For example, if the purpose is the isolation, 
via immobilization, of an analyte containing a target 
pilC sequence, the oligomers would contain a 
polynucleotide region which is of sufficient length and 
complementarity to the targeted pile sequence to afford 
sufficient duplex stability to immobilize the analyte on 
a solid surface, via its binding to the oligomers, under 
the isolation conditions. For example, also, if the 
oligomers are to serve as primers for the transcription 
and/or replication of target pile sequences in an analyte 
polynucleotide, the oligomers would contain a 
polynucleotide region of sufficient length and 
complementarity to a region flanking the targeted pile 
sequence to allow the polymerizing agent to continue 
replication from the primers which are in stable duplex 
form with the target sequence, under the polymerizing 
conditions. The oligomers may contain a minimum of about 
4 contiguous nucleotides which are complementary to a 
targeted pile sequence; usually the oligomers will 
contain a minimum of about 8 contiguous nucleotides which 
are complementary to the targeted pile sequence, and 
preferably will contain a minimum of about 14 contiguous 
nucleotides which are complementary to the targeted pile 
sequence. 

The oligomer, however, need not consist only of 
the sequence which is complementary to the targeted pile 
sequence. It may contain in addition, nucleotide 
sequences or other moieties which are suitable for the 
purposes for which the oligomers are used. For example, 
if the oligomers are used as primers for the 
amplification of targeted pile sequences via the 
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polymerase chain reaction (PCR) , they may contain 
sequences which, when in duplex, form restriction enzyme 
sites which facilitate the cloning of the amplified 
sequences. Other types of moieties or sequences which 
5 are useful of which the oligomers may be comprised or 
coupled to, are those which are known in the art to be 
suitable for a variety of purposes, including the 
labeling of nucleotide probes. 

In the basic nucleic acid hybridization assay, 
10 single- stranded analyte nucleic acid (either DNA or rna) 
is hybridized to a nucleic acid probe, and resulting 
duplexes are detected. The probes for pile sequences 
(natural or derived) are a length which allows the 
detection of these sequences by hybridization. While 
15 6-8 nucleotides may be a workable length, sequences of 

10-12 nucleotides are preferred, and about 20 nucleotides 
or more appears optimal. Preferably, these sequences 
will derive from regions which lack heterogeneity. These 
probes can be prepared using routine methods, including 
20 automated oligonucleotide synthetic methods. For use as 
probes, complete complementarity is desirable, although 
it may be unnecessary as the length of the fragment is 
increased. 

For use of such probes as agents to detect the 
25 presence of pile sequences, the sample to be analyzed 

(which may be biological) may be treated, if desired, to 
extract the nucleic acids contained therein. The 
resulting nucleic acid from the sample may be subjected 
to gel electrophoresis or other size separation 
techniques; alternatively, the nucleic acid sample may be 
dot blotted without size separation. In order to form 
hybrid duplexes with the targeting sequence of the probe, 
the targeted region of the analyte nucleic acid must be 
in single- stranded form. The latter may occur naturally; 
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alternatively, it may be accomplished by denaturation. 
Denaturation can be accomplished by various techniques 
known in the art. Subsequent to denaturation, the 
analyte nucleic acid and probe are incubated under 
5 conditions which promote stable hybrid formation of the 
target sequence in the probe with the putative targeted 
sequence in the analyte, and the resulting duplexes 
containing probe (s) are detected. 

Detection of the resulting duplex, if any, is 
10 usually accomplished by the use of labeled probes; 

alternatively, the probe may be unlabeled, but may be 
detectable by specific binding with a ligand which is 
labeled, either directly or indirectly. Suitable labels, 
and methods for labeling probes and ligands are known in 
15 the art, and include, for example, radioactive labels 
which may be incorporated by known methods (e.g., nick 
translation or kinasing) , biotin, fluorescent groups, 
chemiluminescent groups (e.g., doxetanes, particularly 
triggered dioxetands) , enzymes, antibodies, and the like. 
20 Variations of this basic scheme are known in the art. 

If the targeted pile sequences are expected to 
be present at relatively low levels, amplification may be 
required for their detection. Such techniques are known 
in the art. For example, the Enzo Biochemical 
25 Corporation "Bio- Bridge" system uses terminal 
deoxynucleotide transferase to add unmodified 
3' -poly-dT- tails to a DNA probe. The poly dT-tailed 
probe is hybridized to the target nucleotide sequence, 
and then to a biotin-modif ied poly-A. PCT application 
10 84/03520 and EPA124221 describe a DNA hybridization 
assay in which: (1) analyte is annealed to a 
single- stranded DNA probe that is complementary to an 
enzyme- labeled oligonucleotide; and (2) the resulting 
tailed duplex is hybridized to an enzyme -labeled 
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oligonucleotide. EPA 204510 describes a DNA 
hybridization assay in which analyte DNA is contacted 
with a probe that has a tail, such as a poly-dT tail, an 
amplifier strand that has a sequence that hybridizes to 
5 the tail of the probe, such as a poly- A sequence, and 
which is capable of binding a plurality of labeled 
strands. A particularly desirable technique may first 
involve amplification of the target pile sequences. The 
target pile sequences in sera may be amplified, for 
10 example, to approximately 10 6 sequences/ml. This may be 
accomplished, for example, by the polymerase chain 
reactions (PCR) technique described by Saiki et al. 
(1986), by Mullis, U.S. Patent No. 4,683,195, and by 
Mullis et al. U.S. Patent No. 4,683,202. Amplification 
15 may be prior to, or preferably subsequent to purification 
of the pile target sequence. For example, amplif ication 
may be utilized in conjunction with the assay methods 
described in U.S. Patent No. 4,868,105, or if even 
further amplification is desired, in conjunction with the 
20 hybridization system in EPO Publication No. 317,077. 

Generally, in the PCR technique, short 
oligonucleotide primers are prepared which match opposite 
ends of a desired sequence. The sequence between the 
primers need not be known. A sample of polynucleotide is 
25 extracted and denatured, preferably by heat, and 

hybridized with oligomers which are oligonucleotide 
primers, which are present in molar excess. 
Polymerization is catalyzed by a template- and primer- 
dependent polymerase in the presence of deoxynucleotide 
30 triphosphates (dNTPs) , and may also be in the presence of 
nucleotide analogs. This results in two "long products" 
which contain the respective primers at their 5' -termini, 
covalently linked to the newly synthesized complements of 
the original strands. The replicated DNA is again 
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denatured, hybridized with oligonucleotide primers, 
returned to polymerizing conditions, and a second cycle 
of replication is initiated. The second cycle provides 
the two original strands, the two long products from 
5 cycle 1, and two "short products" replicated from the 
long products. The short products contain sequences 
(sense or antisense) derived from the target sequence, 
flanked at the 5'- and 3' -termini with primer sequences. 
On each additional cycle, the number of short products is 

10 replicated exponentially. Thus, this process causes 
amplif ication of a specific target sequence. 

It will be understood that "primer", as used 
herein, may refer to more than one primer, particularly 
in the case where there is some ambiguity in the 

15 information regarding the terminal sequence (s) of the 
target region to be amplified. Hence, a "primer" 
includes a collection of primer oligonucleotides 
containing sequences representing the possible variations 
in the sequence or includes nucleotides which allow a 

20 typical base pairing. One of the primer oligomers in 
this collection will be homologous with the end of the 
target sequence. 

The amplified sequence (s) may then be detected 
using a hybridization assay which utilize nucleic acid 

25 mul timers which bind to single -stranded analyte nucleic 
acid, and which also bind to a multiplicity of 
single -stranded labeled oligonucleotides. A suitable 
solution phase sandwich assay which may be used with 
labeled polynucleotide probes, and the methods for the 

30 preparation of probes is described in EPO 225,807, 
published June 16, 1987. 

The probes can be packaged into diagnostic 
kits. Diagnostic kits include the probe DNA, which may 
be labeled; alternatively, the probe DNA may be unlabeled 
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and the ingredients for labeling may be included in the 
kit in separate containers. The kit may also contain 
other suitably packaged reagents and materials needed for 
the particular hybridization protocol, for example, 
standards, buffers, as well as instructions for 
conducting the test using the kit ingredients. 

The pile DNA sequence information in the clones 
described in the Examples may be used to gain further 
information on the remaining sequence of the pile gene 
from meningococci, for other possible alleles of pile in 
Neisseria, as well as pile in other relevant genuses and 
species. This information will aid in the 
characterization of the gene, and of its role in 
virulence of the pathogenic forms of microorganisms, 
15 including, for example, Neisseria, Pseudomonas , 

Bacteroides, Moraxella, and Vibrio. Moreover, this 
sequence information can lead to additional 
polynucleotide probes, polypeptides derived from pile, 
multiple pile loci, and antibodies directed against Pile 
epitopes which would be useful for the diagnosis and/or 
treatment of infections caused by the relevant pathogenic 
microorganisms . 

The DNA sequence information in the 
above-mentioned clones is useful for the design of probes 
25 for the isolation of additional DNA sequences which are 
derived from as yet undefined regions of pile. For 
example, labeled probes containing a sequence of 
approximately 8 or more nucleotides, and preferably 20 or 
more nucleotides, which are derived from regions close to 
the ends of the DNA sequences shown in the Examples. 
These probes may be used to isolate overlapping DNA 
sequences within or adjacent to pile from DNA libraries 
created from genomes of species having type 4 pilins. 
The resulting overlapping DNAs may then be used to 
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synthesize probes for identification of other overlapping 
fragments which do not necessarily overlap the DNAs whose 
sequences are given in the Examples. Thus, it is 
possible to sequence entire pile genes utilizing the DNA 
sequences provided herein and the technique of isolation 
of overlapping DNAs derived from the pile genes. 

Methods for constructing DNA libraries are 
known in the art, and are discussed infra; for example, a 
method for the construction of pile libraries in 
lambda-gtli is discussed infra in Section IV. A. However, 
DNA libraries which are useful for screening with nucleic 
acid probes may also be constructed in other vectors 
known in the art, for example, lambda-gtlO (Huynh et al . 
(1985)). Another suitable vector for the creation of 
libraries may be EMBL3 , which is a replacement vector 
which accepts inserts ranging from 9 to 23 kb in size. 
In general, methods for constructing DNA libraries is 
discussed in Maniatis et al, MOLECULAR CLONING, 2nd 
edition, (1989). 

The sequence information derived from these 
overlapping pile DNAs is useful for determining areas of 
homology and heterogeneity within the pile genets), which 
could indicate the presence of different strains 
gonococci, meningococci, or other hitherto unrecognized 
pathogenic forms of Neisseria. It is also useful for the 
design of hybridization probes to detect Pile antigens or 
pile nucleic acids in biological samples. Moreover, the 
overlapping DNAs may be used to create expression vectors 
for polypeptides derived from pile gene(s) . 

The pile DNA sequence information may also 
allow the construction of additional bacteriostatic 
agents for treatment of neisserial infections, in that 
they may block the expression of Pile and/or pilin 
assembly. For example, it may be used to derive 
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antisense polynucleotides. Antisense polynucleotides 
molecules are comprised of a complementary nucleotide 
sequence which allows them to hybridize specifically to 
designated regions of genomes or RNAs. Antisense 
5 polynucleotides may include, for example, molecules that 
will block protein translation by binding to mRNA, or may 
be molecules which prevent replication of DNA by 
replicase. They may also include molecules which carry 
agents (non- covalently attached or covalently bound) 
10 which cause the mRNA or genomic DNA to be inactive by 
causing, for example, scissions in these molecules. 
Antisense molecules which are to hybridize to pile 
derived polynucleotides may be designed based upon the 
sequence information of the pile DNA sequences provided 
15 herein, including those which would be isolated from 

additional DNA libraries. The antibacterial agents based 
upon ant i- sense polynucleotides for pile may be designed 
to bind with high specificity, to be of increased 
solubility, to be stable, and to have low toxicity. 
Hence, they may be delivered in specialized systems, for 
example, liposomes, or by gene therapy. In addition, 
they may include analogs, attached proteins, substituted 
or altered bonding between bases, etc. 

Other types of drugs may be based upon 
25 polynucleotides which "mimic" important control regions 
of the pile gene, and which may be therapeutic due to 
their interactions with key components of the system 
responsible for expression of the gene. 

In addition to the specific methods described 
30 in the Examples, general methods are known which may be 
used in the practice of the invention. For example, 
general techniques used in extracting the genome from 
bacteria, including Neisseria, preparing and probing a 
DNA library, sequencing clones, constructing expression 



20 



BN8DOCID: <WO B213e71A1JA> 




♦ 



WO 92/13871 PCT/US92/00863 



-53- 

vectors, transforming cells, performing immunological 
assays such as radioimmunoassays and ELISA assays, for 
growing cells in culture, and the like are known in the 
art and laboratory manuals are available describing these 
5 techniques. However, as a general guide, the following 
sets forth some sources currently available for such 
procedures, and for materials useful in carrying them 
out . 

Both prokaryotic and eukaryotic host cells may 

10 be used for expression of desired coding sequences when 
appropriate control sequences which are compatible with 
the designated host are used. Among prokaryotic hosts, 
E. coll is most frequently used. Expression control 
sequences for prokaryotes include promoters, optionally 

15 containing operator portions, and ribosome binding sites. 
Transfer vectors compatible with prokaryotic hosts are 
commonly derived from, for example, pBR322, a plasmid 
containing operons conferring ampicillin and tetracycline 
resistance, and the various pUC vectors, which also 

20 contain sequences conferring antibiotic resistance mark- 
ers. These markers may be used to obtain successful 
transformants by selection. Commonly used prokaryotic 
control sequences include the Beta -lactamase 
(penicillinase) and lactose promoter systems (Chang et 

25 al. (1977)), the tryptophan (trp) promoter system 
(Goeddel et al. (1980)) and the lambda -derived P T 

Li 

promoter and N gene ribosome binding site (Shimatake et 
al. (1981)) and the hybrid tac promoter (De Boer et al . 
(1983)) derived from sequences of the trp and lac UV5 
30 promoters. The foregoing systems are particularly 

compatible with E. coli; if desired, other prokaryotic 
hosts such as strains of Bacillus or Pseudomonas may be 
used, with corresponding control sequences. 
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Eukaryotic hosts include yeast and mammalian 
cells in culture systems. Saccharomyces cerevislae and 
Saccharomyces carlsbergensis are the most commonly used 
yeast hosts, and are convenient fungal hosts. Yeast 
compatible vectors carry markers which permit selection 
of successful transf ormants by conferring prototrophy to 
auxotrophic mutants or resistance to heavy metals on 
wild- type strains. Yeast compatible vectors may employ 
the 2 micron origin of replication (Broach et al. 
(1983)), the combination of CEN3 and ARSl or other means 
for assuring replication, such as sequences which will 
result in incorporation of an appropriate fragment into 
the host cell genome. Control sequences for yeast 
vectors are known in the art and include promoters for 
the synthesis of glycolytic enzymes (Hess et al. (1968); 
Holland et al. (1978)), including the promoter for 3 
phosphoglycerate kinase (Hitzeman (1980)). Terminators 
may also be included, such as those derived from the 
enolase gene (Holland (1981)). Particularly useful 
control systems are those which comprise the 
glyceraldehyde-3 phosphate dehydrogenase (GAPDH) promoter 
or alcohol dehydrogenase (ADH) regulatable promoter, 
terminators also derived from GAPDH, and if secretion is 
desired, leader sequence from yeast alpha factor. In 
addition, the transcriptional regulatory region and the 
transcriptional initiation region which are operably 
linked may be such that they are not naturally associated 
in the wild- type organism. These systems are described 
in detail in EPO 120,551, published October 3, 1984; EPO 
116,201, published August 22, 1984; and EPO 164,556, 
published December 18, 1985. 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
immortalized cell lines available from the American Type 
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Culture Collection (ATCC) , including HeLa cells, Chinese 
hamster ovary (CHO) cells, baby hamster kidney (BHK) 
cells, and a number of other cell lines. Suitable 
promoters for mammalian cells are also known in the art 
and include viral promoters such as that from Simian 
Virus 40 (SV40) (Fiers (1978)), Rous sarcoma virus (RSV) , 
adenovirus (ADV) , and bovine papilloma virus (BPV) . 
Mammalian cells may also require terminator sequences and 
poly A addition sequences; enhancer sequences which 
increase expression may also be included, and sequences 
which cause amplification of the gene may also be 
desirable. These sequences are known in the art. 

Vectors suitable for replication in mammalian 
cells are known in the art, and may include viral 
15 replicons, or sequences which insure integration of the 
appropriate sequences encoding Pile epitopes into the 
host genome. 

A vector which is used to express foreign DNA, 
and which may be used in vaccine preparation is Vaccinia 
20 virus. In this case the heterologous DNA is inserted 

into the Vaccinia genome. Techniques for the insertion 
of foreign DNA into the vaccinia virus genome are known 
in the art, and utilize, for example, homologous 
recombination. The insertion of the heterologous DNA is 
generally into a gene which is non-essential in nature, 
for example, the thymidine kinase gene (tk) , which also 
provides a selectable marker. Plasmid vectors that 
greatly facilitate the construction of recombinant 
viruses have been described (see, for example, Mackett et 
30 al. (1984), Chakrabarti et al . (1985); Moss (1987)). 
Expression of the polypeptide containing at least one 
immunoreactive Pile epitope then occurs in cells or 
individuals which are immunized with the live recombinant 
vaccinia virus . 
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Other systems for expression of desired 
polypeptides include insect cells and vectors suitable 
for use in these cells. These systems are known in the 
art, and include, for example, insect expression transfer 
5 vectors derived from the baculovirus Autographa 

California* nuclear polyhedrosis virus (AcNPV) , which is 
a helper- independent, viral expression vector. 
Expression vectors derived from this system usually use 
the strong viral polyhedrin gene promoter to drive 
10 expression of heterologous genes. Currently the most 
commonly used transfer vector for introducing foreign 
genes into AcNPV is P Ac373. Many other vectors, known to 
those of skill in the art, have also been designed for 
improved expression. These include, for example, pVL985 
15 (which alters the polyhedrin start codon from ATG to ATT 
and which introduces a BamHI cloning site 32 basepairs 
downstream from the ATT; See Luckow and Summers (1989) ) . 

Methods for the introduction of heterologous 
DNA into the desired site in the baculovirus virus are 
20 known in the art. (See Summer and Smith, Texas 

Agricultural Experiment Station Bulletin No. 1555; Ju et 
al. (1987); smith et al. (1983); and Luckow and Summers 
(1989)) . For example, the insertion can be into a gene 
such as the polyhedrin gene, by homologous recombination- 
insertion can also be into a restriction enzyme site 
engineered into the desired baculovirus gene. The 
inserted sequences may be those which encode all or 
varying segments of the polyprotein, or other orfs which 
encode viral polypeptides. For example, the insert could 
encode the following numbers of amino acid segments from 
the polyprotein: amino acids 1-1078; amino acids 332-662- 
amino acids 406-662; amino acids 156-328, and amino acids 
199-328 . 
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The signals for posttranslational 
modifications, such as signal peptide cleavage, 
proteolytic cleavage, and phosphorylation, appear to be 
recognized by insect cells. The signals required for 
5 secretion and nuclear accumulation also appear to be 

conserved between the invertebrate cells and vertebrate 
cells. Examples of the signal sequences from vertebrate 
cells which are effective in invertebrate cells are known 
in the art, for example, the human interleukin 2 signal 
10 (IL2 g ) which is a signal for transport out of the cell, 
is recognized and properly removed in insect cells. 

Recombinant polynucleotides are inserted into 
host cells by transformation. Transformation may be by 
any known method for introducing polynucleotides into a 
15 host cell, including, for example packaging the 

polynucleotide in a virus and transducing a host cell 
with the virus, and by direct uptake of the 
polynucleotide. The transformation procedure used 
depends upon the host to be transformed. Bacterial 
20 transformation by direct uptake generally employs 
treatment with calcium or rubidium chloride (Cohen 
(1972); Maniatis (1989)). Yeast transformation by direct 
uptake may be carried out using the method of Hinnen et 
al. (1978) . Mammalian transformations by direct uptake 
25 may be conducted using the calcium phosphate 

precipitation method of Graham and Van der Eb (1978), or 
the various known modifications thereof. Other methods 
for the introduction of recombinant polynucleotides into 
cells, particularly into mammalian cells, which are known 
in the art include dextran-mediated transfection, calcium 
phosphate mediated transfection, polybrene mediated 
transfection, protoplast fusion, electroporation, 
encapsulation of the polynucleotide (s) in liposomes, and 
direct microinjection of the polynucleotides into nuclei. 
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The recombinant polynucleotide may be in the 
form of a vector. Vector construction employs techniques 
which are known in the art. Site-specific DNA cleavage 
is performed by treating with suitable restriction 
enzymes under conditions which generally are specified by 
the manufacturer of these commercially available enzymes. 
In general, about 1 microgram of plasmid or DNA sequence 
is cleaved by 1 unit of enzyme in about 20 microliters 
buffer solution by incubation of 1-2 hr at 37° C. After 
incubation with the restriction enzyme, protein is 
removed by phenol /chloroform extraction and the DNA 
recovered by precipitation with ethanol . The cleaved 
fragments may be separated using polyacrylamide or 
agarose gel electrophoresis techniques, according to the 
general procedures found in Methods in Enzymology (1980) 
£1:499-560. 

Sticky ended cleavage fragments may be blunt 
ended using E. coli DNA polymerase I (Klenow) in the 
presence of the appropriate deoxynucleotide triphosphates 
(dNTPs) present in the mixture. Treatment with SI 
nuclease may also be used, resulting in the hydrolysis of 
any single stranded DNA portions. 

Ligations are carried out using standard buffer 
and temperature conditions using T4 DNA ligase and ATP; 
sticky end ligations require less ATP and less ligase 
than blunt end ligations. When vector fragments are used 
as part of a ligation mixture, the vector fragment is 
often treated with bacterial alkaline phosphatase (BAP) 
or calf intestinal alkaline phosphatase to remove the 
5' -phosphate and thus prevent religation of the vector; 
alternatively, restriction enzyme digestion of unwanted 
fragments can be used to prevent ligation. 

Ligation mixtures are transformed into suitable 
cloning hosts, such as E. coli, and successful 
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transf ormants selected by, for example, antibiotic 

resistance, and screened for the correct construction. 

The desired recombinant DNA sequences may be 

synthesized by synthetic methods. Synthetic 

5 oligonucleotides may be prepared using an automated 

oligonucleotide synthesizer as described by Warner 

(1984 ll If desired the synthetic strands may be labeled 

wi th J ^P by treatment with polynucleotide kinase in the 
32 

presence of P-ATP, using standard conditions for the 
10 reaction. 

DNA sequences, including those isolated from 
DNA libraries, may be modified by known techniques, 
including, for example site directed mutagenesis, as 
described by Zoller (1982). Briefly, the DNA to be 

15 modified is packaged into phage as a single stranded 

sequence, and converted to a double stranded DNA with DNA 
polymerase using, as a primer, a synthetic 
oligonucleotide complementary to the portion of the DNA 
to be modified, and having the desired modification 

20 included in its own sequence. The resulting double 

stranded DNA is transformed into a phage supporting host 
bacterium. Cultures of the transformed bacteria, which 
contain replications of each strand of the phage, are 
plated in agar to obtain plaques. Theoretically, 50% of 

IS the new plaques contain phage having the mutated 
sequence, and the remaining 50% have the original 
sequence. Replicates of the plaques are hybridized to 
labeled synthetic probe at temperatures and conditions 
which permit hybridization with the correct strand, but 

\0 not with the unmodified sequence. The sequences which 
have been identified by hybridization are recovered and 
cloned. 

DNA libraries may be probed using the procedure 
of Grunstein and Hogness (1975) . Briefly, in this 



BM8DOCID: <WO_0213671A1JA> 



WO 92/13871 




PCT/US92/00863 



-60- 



procedure, the DNA to be probed is immobilized on nitro- 
cellulose filters, denatured, and prehybridized with a 
buffer containing 0-50% formamide, 0.75 M NaCI, 75 mM Na 
citrate, 0,02% (wt/v) each of bovine serum albumin, poly- 
5 vinyl pyrollidone, and Ficoll, 50 mM Na Phosphate (pH 
6.5), 0.1% SDS, and 100 micrograms/ml carrier denatured 
DNA. The percentage of formamide in the buffer, as well 
as the time and temperature conditions of the 
prehybridization and subsequent hybridization steps 
10 depends on the stringency required. Oligomeric probes 
which require lower stringency conditions are generally 
used with low percentages of formamide, lower 
temperatures, and longer hybridization times. Probes 
containing more than 30 or 40 nucleotides such as those 
15 derived from cDNA or genomic sequences generally employ 
higher temperatures, e.g., about 40-42°C, and a high 
percentage, e.g., 50%, formamide. Following 
prehybridization, 5' - 32 P-labeled oligonucleotide probe is 
added to the buffer, and the filters are incubated in 
20 this mixture under hybridization conditions. After 
washing, the treated filters are subjected to 
autoradiography to show the location of the hybridized 
probe; DNA in corresponding locations on the original 
agar plates is used as the source of the desired DNA. 
25 For routine vector constructions, ligation 

mixtures are transformed into E. coli strain HB101 or 
other suitable hoBt, and successful transf ormants 
selected by antibiotic resistance or other markers. 
Plasmids from the transf ormants are then prepared 
30 according to the method of Clewell et al . (1969), usually 
following chloramphenicol amplif ication (Clewell (1972)). 
The DNA is isolated and analyzed, usually by restriction 
enzyme analysis and/or sequencing. Sequencing may be by 
the dideoxy method of Sanger et al. (1977) as further 
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described by Messing et al . (1981), or by the method of 
Maxam et al . (1980). Problems with band compression, 
which are sometimes observed in GC rich regions, were 
overcome by use of T-deazoguanosine according to Barr et 
5 al. (1986) . 

An enzyme- linked immunosorbent assay (ELISA) 
can be used to measure either antigen or antibody 
concentrations. This method depends upon conjugation of 
an enzyme to either an antigen or an antibody, and uses 

0 the bound enzyme activity as a quantitative label. To 
measure antibody, the known antigen is fixed to a solid 
phase (e.g., a microplate or plastic cup), incubated with 
test serum dilutions, washed, incubated with 
anti- immunoglobulin labeled with an enzyme, and washed 

5 again. Enzymes suitable for labeling are known in the 
art, and include, for example, horseradish peroxidase. 
Enzyme activity bound to the solid phase is measured by 
adding the specific substrate, and determining product 
formation or substrate utilization colorimetrically . The 

3 enzyme activity bound is a direct function of the amount 
of antibody bound. 

To measure antigen, a known specific antibody 
is fixed to the solid phase, the test material containing 
antigen is added, after an incubation the solid phase is 

5 washed, and a second enzyme -labeled antibody is added. 

After washing, substrate is added, and enzyme activity is 
estimated colorimetrically, and related to antigen 
concentration . 

Examples 

» Described below are examples of the present 

invention which are provided only for illustrative 
purposes, and not to limit the scope of the present 
invention. In light of the present disclosure, numerous 
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embodiments within the scope of the claims will be appar- 
ent to those of ordinary skill in the art. 

Example l 

5 Isolation of Pi 1 r 

An outer membrane preparation from 
N. gonorrhoeae strain MSll ms (P + ) contains small amounts 
of a 110 kd protein, Pile. This protein was enriched 
during alternate cycles of crystallization and 
10 solubilization of pili, unlike other outer membrane 

proteins that decreased in abundance by this procedure. 

The materials and methods used for the 
isolation procedure were the following. 
Bacterial Strains and growth rn n ditions 

N. gonorrhoeae MSll^ (Meyer et al., 1984) and 
P" and P"n variants of MSll^ (Swanson et al., 1986) were 
kindly obtained from Dr. M. So and from Dr. M. Koorney, 
respectively. The gonococcal isolates UM01 and KH4318 
have previously been described (Norlander et al . , 1981). 
N. gonorrhoeae strains 605344 and 605103 were obtained 
from Dr. D. Danielsson, Orebro, Sweden, and strain 765 
was isolated at the Department of Bacteriology in Umea, 
Sweden. The commensal Neisseria species N. lactamica 
Nctc 10618 and N. subflava GN01 were obtained from 
25 Pharmacia, Uppsala, Sweden. These bacteria were grown at 
37°C in a 5% C0 2 atmosphere on Difco GCB agar containing 
Kellogg's supplement. Piliated (P + ) and nonpiliated (P") 
variants were distinguished by colony morphology and 
passed as single colonies. E. coli strain Y 1090 
3 0 (obtained from Promega Biotech) was used for plaque 
screening, DH5 (Hanahan, 1985) for molecular cloning, 
AA10 recA (Stoker et al. 1984) for isolation of minicells 
and TGI (Gill et al., 1986) for propagation of M13 
clones . 
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Preparation Qf Pili and outer membranea 

Pili were prepared essentially as described by 
Brinton et al . (1978). Gonococci (P + Tr) from 80 GGB 
5 plates, grown for 18 h, were harvested in 0.05 M Tris- 
HC1 pH 8.0 and 0.15 M NaCl, washed twice and resuspended 
in 40 ml 0.15 M ethanolamine pH 10.5. Pili were sheared 
off in a Sorvall Omnimixer, setting 3 for 30 s. The cell 
debris was pelleted at 13,000 g for 30 min at 4°C and the 

10 supernatant was dialyzed against 0.05 M Tris-HCl pH 8.0 
and 0.15 M NaCl . The crystallized pili were pelleted at 
13,000 g for 60 min., resuspended in 0.15 M ethanolamine 
pH 10.5, and centrifuged at 23,000 g for 60 min. The 
supernatant was dialyzed as described above against 0.05 

15 M Tris-HCl pH 8 . 0 and 0.15 M NaCl. Several cycles of 
crystallization and solubilization were performed to 
produce pili preparations with high purity. Outer 
membranes of N. gonorrhoeae were prepared by the sarkosyl 
method described by Norquist et al. (1978) . 

20 

Example 2 

Preparation of Purified Ant i -Pile Antibodi^ 

The 110 kd protein present in purified MSll^ 
pili preparations was eluted from SDS polyacrylamide gels 
25 and rabbit antibodies were generated against the gel 
purified protein. The antiserum cross reacted 
extensively with the pilin protein in immunoblots and was 
therefore absorbed with extracts of Pseudomonas putida 
expressing the pilin subunit of N. gonorrhoeae on plasmid 
30 pGC02. 

Pili preparations of N. gonorrhoeae MSll^fP" 1 ") 
crystallized 5 times were separated on 10% SDS -poly- 
acrylamide gels using the buffer system of Laemmli 
(1970). These gels were stained in 0.25 M KC1 and 1 mM 
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DTT for 5 min., the 110 kd protein band was sliced out, 
crushed and incubated in a buffer containing 0.05 M Tris- 
HC1 pH 7.9, 0.1 mM EDTA, 5 mM DTT and 0.15 M NaCl at 4°C 
overnight. Gel pieces were removed by centrif ugation 
5 prior to immunization of rabbits. 

The achieved 110 kd- antiserum was extensively 
absorbed with Pseudomonas putida 2440 (Bagdasarien et al . 
(1983) , carrying a recombinant plasmid, pGC02, 
constructed as follows. The 1.0 kb Hpal-EcoRl fragment 

10 of the pilus gene clone pNGHOO (Meyer et al. 1984) 

obtained from M. So was cloned into the Hpal and EcoRI 
sites of pMMB66 (FQrste et al., 1986). The pilE gene is 
then under control of the tac promoter and induction with 
1 mM IPTO resulted in high levels of pilin produced in P. 

15 putida 2440, but no extracellular pili structures were 
observed. Dense sonicated cultures of P. putida 
2440/pGC02 were mixed in a 1:1 ratio with the crude 
antiserum. About 15 cycles of 1 h incubation and 3 0 min 
centrif ugation at 25,000 g in the presence of l mM PMSF 

20 (phenylmethylsulf onylf luoride) at 4°C were performed. 

The pili antiserum used in immunoblots was 
generated in a rabbit against highly purified pili 
preparations of N. gonorrhoeae MSll^. 

In immunoblots 10 fig of boiled bacterial cells 

25 or the same amount of outer membranes were 

electrophoresed on 10% SDS-polyacrylamide gels. The 
proteins were transferred from the gel onto 
nitrocellulose sheets where their immunological cross - 
reaction with the 110 kd absorbed antiserum was tested 

3 0 using an immunoblotting protocol as described by Towbin 
et al . (1979) . 
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Specificity of P urified Anti-PilC Antibodies 

The absorbed antiserum was used in immunoblots 
with whole cell extracts of a number of N. gonorrhoeae 
5 strains as well as commensal strains of Neisseria (Figure 
2A) . All strains of N* gonorrhoeae, except strain 
605103, contained one or two high molecular weight 
protein species reacting with the antiserum. Strain 
605103, unlike the other strains tested, was nonpiliated 

10 and no piliated variants could be obtained suggesting 
that it is a P"n variant (Swanson et al., 1985). This 
was confirmed by Southern blot hybridization using an 
oligonucleotide probe corresponding to the 5' end of the 
pilE gene. No hybridization was obtained with this 

15 probe. The commensal N. lactamica Nctcl0618, but not N. 
subflava GN01, contained a high molecular weight protein 
reacting with the 110 kd antiserum. Immunoblots against 
outer membrane preparations of P + and P'n MSll^ showed 
the 110 kd protein to be present in the outer membrane in 

20 both of these MS11 variants. 

Southern blot hybridization was accomplished as 
follows. Digested genomic DNA was separated on 0.7% 
agarose gels and transferred to nitrocellulose filters 
(Southern, 1975) . After transfer and baking the filters 

25 were prehybridized in a mixture of 5 x SSC, 0.1% SDS, 
5 mM EDTA, 5 x Denhardt's solution and 100 /xg/ml of 
sonicated calf thymus DNA at 65°C for 2-6 h. 32 P-labeled 
probe (multiprime DNA labelling system, Amersham 
International) was added and hybridization was performed 

3 0 for 12-15 h at the same temperature. The filters were 
washed in 2 x SSC with 0.1% SDS and in 0.2 x SSC with 
0.1% SDS for 2 x 15 min each, dried and exposed to Kodak 
XRP film at -80°C. 
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A 21 -base- long oligonucleotide complementary to 
the signal peptide coding region of pilE (5'- 
GCCTTTTTGAAGGGTATTCAT - 3 ' ) was 32 P-labeled with T4 
polynucleotide kinase and used to probe Clal -digested 
genomic DNA. The blot was prehybridized at 37 «C in a 
mixture containing 2 x Denhardt's, 0.1% SDS, 2.5 mM EDTA, 
5 x SSC and 100 fig/val sonicated calf thymus DNA, 
hybridized at 37°C and washed in 2 x SSC for 5 min. 
MS11 mk (p * ) 9 ave a 4 kb hybridization fragment, whereas 
MS11 mk (p " n) and 605103 gave no hybridization signal. 



Example i 

MQleCUlar rlonina of the niln aono 
encod-inor fl xio icd prfttftin 
Chromosomal DNA from N. gonorrhoeae MSll mJc (P + ) 
was used to construct a Xgtll library. The library was 
screened with the absorbed 110 kd antiserum and one 
positive clone out of 10,000 plaques was found, 
containing an 800 bp insert. A lysogen of this positive 
Xgtll clone was examined in immunoblots and a fusion 
protein with an estimated size of 150 kd reacted with the 
antiserum (data not shown) . The 800 bp insert was 
purified, labeled with 32 P, and used as a probe to screen 
a plasmid library from N. gonorrhoeae MSll^. six clones 
25 out of 10,000 hybridized with the probe. Restriction 

maps for these partially overlapping six clones are shown 
in Figure 1. 

In Figure 1, plasmids pABJ04-09, which all 
belong to locus 1, were isolated from a plasmid library 
using the 800 bp insert from Xgtll as a probe. The Xgtll 
insert (from locus 2) has an additional Sail site not 
found in the plasmid clones. The position of the piici 
gene and direction of its transcription (indicated by an 
arrow) were determined in E. coli minicells. Three 
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thickened lines, with an arrow-head at each end, indicate 
fragments used as probes in Southern hybridizations. 
I.e., the 800 bp insert from Xgtll, the £coRV 1 - EcoRV 2 
(1.3kb) and the EcoRV 2 -HindIII 4 (0.8 kb) fragments of 
pAGJ04. Triangles mark the location of two mTnCm 
insertions in pABJ04 . The resulting plasmids, 
pABJ) 4 : :mTnCm-12 and pABJ04 : :mTnCm- 14 were used, were 
used to inactivate pllCl and pilC2. 

The six plasmid clones, pABJ04-09, were 
transformed into the minicell producing strain AA10 to 
monitor expression of plasmid encoded [ 3 5 S] methionine 
labeled proteins. The E. coll minicell strain AA10 was 
transformed with plasmid DNA (pABJ04-09) and chromosome 
deficient minicells from these strains were purified over 
15 sucrose gradients (Thompson and Achtman, 1978) . The 

plasmid- encoded proteins were labeled in the presence of 
80 /xCi [ 3 5 S] methionine in minimal salts medium and 1% 
methionine assay medium (Dif co) . After lysis of the 
minicells in sample buffer (Laemmli, 1970) the proteins 
were electrophoresed on an SDS-polyacrylamide gel, the 
gel was dried and exposed to X-ray film (Kodak X-OmatAR) . 

Plasmid pABJ04 expressed minute amounts of 
three high molecular weight proteins, 113, ill and 108 kd 
in size, as well as a number of lower molecular weight 
25 protein species not produced from the vector control. 
The three high molecular weight bands were missing in 
pABJ05 and pABJ06 but three novel lower molecular weight 
protein species had appeared, suggesting that pABJ05 and 
PABJ06 are deleted for the 3' end of a gene, denoted 
pilCl, and that this gene is responsible for all three 
high molecular weight species. This suggested that the 
distal end of the gene must be located between the Mlul 1 
and Muli 2 sites (Figure 1) . The observation that plasmid 
PABJ07 did not express any high molecular proteins 
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tentatively located the 5' end of the gene to a region 
0.5-1.2 kb to the right of the EcoRV 3 site. The size for 
a gene encoding a 110 kd protein is ~3 kb which is in 
agreement with these mapping data. 



Example 5 

Identification of a Second Gene Encoding Pile 

The 800 bp insert in Xgtll contains a single 
Sail site not present in the region on pABJ04 which 
10 hybridized to this fragment, suggesting that there is 

more than one pilC locus in the genome of N. gonorrhoeae 
MS11. This was confirmed in Southern blot hybridizations 
in which three different pilC fragments were used to 
probe Smal and Clal digested genomic DNA. The 800 bp 
15 fragment from Xgtll hybridized in a Southern blot to two 
Clal (18 and 8 kb) and Smal (13 and 4.5 kb) fragments of 
DNA prepared from N. gonorrhoeae MSll^. Since the probe 
does not contain any internal Clal or Smal sites, there 
are presumably two copies of the 3 ' end of pilC in the 
20 MS 11 genome. The 1.3 kb EcoRV^ - EcoRV 2 fragment of 

pABJ04 carries the central region of pilCl. This probe 
hybridized to the same two Clal fragments and to four 
Smal fragments, two of which are the same size as the two 
Smal fragments identified with the 800 bp probe (13 kb 
25 and 4.5 kb) . Hybridization with the 800 bp probe was 
more extensive to the 8 kb Clal and the 4 kb Smal 
fragment whereas the reverse was found with the 1.3 kb 
EcoKV 1 - EcoRV 2 fragment from pABJ04 strongly suggesting 
that the two genomic copies of pilC show a significant 
30 sequence variation in the 3' as well as in the central 

region. A probe corresponding to the 5' region of pilCl 
was also used in Southern hybridization experiments. 
This 0.8 kb HindIII 4 - EcoRV 3 fragment hybridized to two 
Clal (18 kb and 4 kb) and Smal (25 kb and 7 kb) fragments 
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with seemingly equal efficiency. The hybridization 

pattern was identical using DNA from MSII__. Taken 

ins 

together these hybridization data indicate that 
N, gonorrhoeae MSll contains two complete copies of pile. 
5 Furthermore the two genes appear to be more homologous in 
their 5' as compared to their central and 3' regions. 

The results indicate that the 800 bp insert 
from Xgtll carries information from pilC2 whereas the 
clones pABJ04-09 must carry information from pilCl. 

10 Finally pxlC2 must be located >2 kb from either end of 
pilCl. The DNA sequence of the 3 '-end of the pilC2 
fragment is shown in Figure 7. The sequence showing the 
putative amino acids encoded therein are shown in Figure 
8. A comparison of the analogous portions of pllC2 (top) 

15 and pilCl (bottom) DNA sequences, and the putative amino 
acids encoded therein are shown in Figure 9, 

The 800 bp fragment from pilC2 was also used to 
probe digested genomic DNA from N. gonorrhoeae strains 
UM01, 765 and 605103. The latter isolate does not 

20 express detectable levels of the 110 Jed protein. Strain 
UM01, unlike MSll, contained only one Clal fragment of 15 
k±> that hybridized to the probe (data not shown) . Hence, 
this strain may contain only one copy of pile. Strain 
605103 and 765, on the other hand, each seem to contain 

25 two copies of pile since two Clal and two Smal fragments 
hybridized to the 800 bp probe. 

The commensal N. lactamica Nctcl0618 DNA 
digested with Clal and Smal also hybridized with the 800 
bp probe. Since only one band hybridized in each case 

3 0 this strain may contain only one copy of pilC. In 

contrast, N. subflava GN01 did not hybridize to the 800 
bp pilC2 probe using the same stringency. 
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Characterization of the pile Genes 

The pilCl gene on PABJ04 is translationallv out of frame 
5 The amino terminal sequence of gel purified 110 

kd protein from strain MSll^ (P + ) was determined by 
sequential Edman degradation. For aminoterminal sequence 
determination automated Edman degradations (Edman and 
Bregg, 1967) were performed in an updated Beckman 890C 

10 spinning cup sequencing sequencer. The sequencing 

procedure and the method for analysis of the 3 -phenyl -2- 
thiohydantoin derivatives been described (Engstrfim et 
al., 1984). Considerable difficulties were encountered 
in the method probably due to blocking of the N- terminus. 

15 As a result, only the residues from position 4 to 10 were 
obtained (Figure 2) . 

The 3.3 kb HindIII 4 - Mlul x fragment 
encompassing the entire pilCl gene was sequenced on both 
strands using the dideoxy sequencing method adapted for 

20 single stranded DNA. 

Purified DNA fragments from pABJ04 and PCR- 
amplified 5' end of pilCl was subcloned into M13 vectors 
(Sanger et al., 1980: Yanish-Perroa et al., 1985) and 
sequenced using the chain termination method of Sanger et 

25 al. (1977). Primers used were the M13 17-mer universal 
primer and oligonucleotides synthesized at Symbicon, 
Umed, Sweden or at the Department of Biochemistry, 
Washington University, St., Louis, MO, USA. 

The results of the sequencing showed that the 

3 0 pilCl contained one single open reading frame of 997 

codons (from left to right in Figure 2) and starting at 
an AUG codon 195 bp from the Hindi I X 4 site. Codons 7-12 
in this open reading frame corresponded to amino acids 5- 
10 in the sequence of the gel purified protein. The AUG 
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codon in the beginning of the long open reading frame was 
not preceded by a typical Shine -Dalgarno sequence. 
Moreover, since the 110 kd protein is located in the 
outer membrane of N. gonorrhoeae, we expected the protein 
5 to be translated with a signal sequence. When examining 
the nucleotide sequence, an AUG codon was found in frame 
1 that was preceded by a typical Shine -Dalgarno sequence 
(-AGGAA-). The sequence following this AUG codon would 
encode a typical signal peptide with basic amino acids in 

10 the amino terminal region and a hydrophobic central 

region. However, no signal peptidase cleavage site could 
be predicted following the rules of von Heijne (1983) . A 
tract of 12G residues was found in the region encoding 
the putative signal peptide for PilC. Addition of one G 

15 residue or the loss of two would align the long open 
reading frame with the AUG codon in frame l. The 
translated region in frame 2 contains a putative signal 
peptidase cleavage site between Ala and Gin. A cleavage 
at this site would align the determined amino acid 

20 sequence at positions 5-10 for the 110 kd protein with 
the deduced amino acid sequence. The data therefore 
suggested that the cloned pilCl gene is out of frame due 
to f rameshif ting in the region encoding the signal 
peptide. 

25 Figure 2 shows the nucleotide sequence and the 

deduced amino acid sequence of the 5' -end of pile. The 
amino- terminal sequence of gel purified PilC from 
MS11 ms (p+) is show n in a box below frame 2, a 997 amino 
acid long open reading frame that would code for a 

30 protein about 110 kd in size. Frame 1 contains 41 amino 
acids and is preceded by a putative Shine -Dalgarno 
sequence (underlined) . Two horizontal lines mark a 
stretch of 12 G residues. An addition of one G in this 
region would align the ATG (boxed) in frame 1 with frame 
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2 . Numbers above the sequence show base positions 
relative to the HindIII 4 site (-0) located on pABJ04 . 
The position of two 24 bp oligonucleotide primers 
(opposite stands) used for PCR amplif ication, are 
5 indicated above the sequence by hatched bars. 

Figure 3 shows the nucleotide sequence of the 
sense strand of the pilCl gene. 

Figure 4 shows the shows the nucleotide 
sequence of the sense strand of the pilCl gene and the 
10 amino acids encoded therein . 

genetic inactivation of oiiC2 but not piici 
frfrolisiies expression of the no kd nr^ e in in msh 

Plasmid pABJ04 was mutagenized in E. coli by a 
15 transposon mini-Tn3 derivative, mTnCm. The shuttle 

mutagenesis system developed by Seifert et al., (1986) 
using a miniTn3 carrying the chloramphenicol resistance 
gene was kindly provided by Dr. M. So. Mutagenesis of 
pABJ04 with mTnCm and transformation of N. gonorrhoeae 
were performed as previously described (Seifert et al., 
1990) . MiniTnCm insertions at 30 different positions in 
pABJ04 were identified, two of which mapped within the 
PilC gene. Piliated N. gonorrhoeae MSll^ were 
transformed with 2 /xg plasmid DNA, transf ormants were 
25 selected for on plates containing 10 fig/ml 

chloramphenicol for the single mutants and 3 0 /zg/ml 
chloramphenicol for the double mutants. 

Only two mTnCm insertions had occurred in 
pilCl (Figure 1) . Truncated protein species were seen in 
minicells with the mTnCm-14 insertion located 0.5 kb from 
the 3' end of pilCl but not with the mTnCm-12 insertion 
located 0.5 kb from the 5' end of the gene. Both 
insertion mutants were used in a gene replacement 
^ experiment. Plasmids pABJ04 : :mTnCm- 12 and pABJ04 : rmTnCm- 
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14 were linearized with BamHI and transformed into 
N. gonorrhoeae MSll mk (p + ) and transf ormants resistant to 
10 fig/ml of chloramphenicol were selected. Forty- eight 
P + transf ormants (24 from each experiment) were assayed 
for the presence of Pile in immunoblots . All these 
transf ormants remained capable of expressing the Pile 
protein. Genomic DNA was prepared from seven of the 
chloramphenicol resistant transf ormants (five from 
pABJ04: :mTnCm-12 and two from pABJ04: :mTnCm-l4) , cleaved 
with Clal and PvuII and used in Southern blot experiments 
using the EcoRV 1 - EcoRV 2 fragment of pABJ04 as a probe. 
The 8 kb Clal fragment was unaffected in the mutants 
whereas the 18 kb Clal fragment had been replaced by a 20 
kb fragment. PvuII cleaves within the 1.6 kb mTnCm 
element. The probe detected an 8 kb PvuII fragment in 
both parent and mutant DNA. In the mutants, a novel 
PvuII fragment appeared that was 6.2 kb in size in five 
transf ormants obtained with pABJ04 : :mTnCm-12 and 4 . 8 kb 
in size in two transf ormants with pABJ04 : rmTnCm- 14 . To 
confirm the insertion of mTnCm, a 250 bp EcoRI-Hindlll 
fragment of the CAT GenBlock (Pharmacia, Sweden) , 
containing the PvuII site, was used as a probe. It 
detected the larger of the two Clal fragments as well as 
the 6.2 kb PvuII fragment. In addition, a 2 kb PvuII 
fragment not covered with the pile probe was detected. 
These data demonstrate that we have obtained gene 
replacements in pilCl, whereas pilC2 was unaffected in 
all seven P + , Pilc + transf ormants . A rapid hybridization 
was done to screen the remaining 41 P + transf ormants . 
All but one had mTnCm inserted in pllCl. The remaining 
trans formant had an intact locus 1 and 2 and must 
therefore contain mTnCm elsewhere in the gonococcal 
chromosome. 
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in the same transformation experiments, the 
frequency of P" colony variants was about five- fold 
higher as compared with that occurring normally in strain 
MS11 mk (p+) • Two P'mTnCm-12 transf ormants isolated at 10 
5 /tg/ml of chloramphenicol were also analyzed by Southern 
blot hybridization using the BcoRV^^ - EcoRv 2 fragment of 
PABJ04 and the EcoRI - Hindlll fragment of the CAT 
GenBlock. Each of these mutants carried mTnCm in pilC2 
as evidenced by a replacement of the 8 kb Clal fragment 
10 by a fragment 9.5 kb in size that hybridizes to both 
probes. These pilC2 : :mTnCm insertion mutants did not 
express Pile as determined by immunoblot analysis. 

A p+ . piltt: :mTnCm-i2 mutant was retransf ormed 
with DNA prepared from a P~, pilC2: :mTnCm- 12 mutant and 
15 colonies growing at 30 fig/ml of chloramphenicol were 

selected to obtain double mutants in pile. All resistant 
transf ormants were P" , and when analyzed by Southern blot 
hybridization all contained mTnCm in both pilCl and 
pilC2. Electron microscopy revealed that the P + , 
20 pilCl: rmTnCm- 12 mutant still expressed pili albeit at a 
slightly lower level than the MSll^fP*) parental clone, 
whereas the P", pilC2: :mTnCm- 12 was completely bald as 
was the pilCl, pilC2 double mutant. 

Immunoblot analyses were performed on the P + 
25 pilCli :mTnCm- 12 mutant, the P", pzlC2: :mTnCm- 12 mutant 
and the P" , pilClz rmTnCm- 12, pxlC2: :mTnCm- 12 double 
mutant, using Pile and pili antisera. Inactivation of 
pilCl did not abolish expression of Pile or the pilin. 
Inactivation of pilC2 totally abolished expression of 
PilC but did not affect expression of pilin. The pilCl, 
pilC2 double mutant was PilC" but produced only low 
levels of pilin. Taken together these data imply that 
pxlC2 but not pilCl is expressing Pile in the MS11 
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variant under study. Moreover, inactivation of pilC2 but 
not pilCl was associated with a loss of piliation. 

P + revertants occurred spontaneously at a low 
frequency in the pilC2: :mTnCm-12 mutants. These 
5 revertants expressed pili as determined by electron 

microscopy and also expressed Pile. It is likely that 
PilC expression is due to in- frame switching in pilCl. 

Figure 5 shows the nucleotide sequence of the 
sense strand of the pilCl gene, and the effect of frame 
10 shift on the putative gene products encoded therein. 

The nllC genes of N. gonorrhoeae vary 

in the length of the G tract 
Polymerase chain reaction (PCR) with Tag 

15 polymerase was used to analyze the 5' region of pile 

using two 24 base long synthetic oligonucleotides based 
on the sequence of pilCl (Figure 2) . These 
oligonucleotides would generate an amplified fragment of 
149 bases as judged from the sequence obtained from 

2 0 pABJ04. 

Polymerase chain reaction was carried out in 
100 ill containing 50 ng of genomic DNA or 5 ng of plasmid 
DNA. l.O /iM of each oligonucleotide, 200 /xM of each 
nucleotide, 0.001% gelatin, 1.5 mM MgCl 2 , 10 mM Tris (pH 
25 8.3), 50 mM KC1, 0.25 /il 1 mCi/ml [ 32 P]dATP and 2 U of 
Taq Polymerase (Perkin Elmer Cetus) . The samples were 
passed through 25 cycles: 2 min at 50°C, 1 min at 94 °C 
and 3 min at 72 °C in a Thermal Cycler (Perkin Elmer 
Cetus) . Aliquots of the DNA fragments were denatured at 

3 0 95 °C for 2 min and electrophoresed on standard denaturing 

sequencing gels. 

The amplified products from MSll^fP*) DNA were 
149 and 150 long respectively. In addition, two less 
abundant products of 151 and 148 bases were seen. The 
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amplified products were electroeluted and cloned into 
Ml3mpi8, and twenty phage clones were sequenced using a 
universal primer. Four different sequences were obtained 
(Figure 6) . 

5 Figure 6 shows the nucleotide sequence of PGR 

amplified fragments demonstrating a variation in the 
length of the G tract and sequence differences in the 5'- 
region of the pile genes. The two oligonucleotide 
primers used for the PCR are shown in Figure 2. 
10 Amplified DNA was cloned into M13mp8 and sequenced. 

Shown are the complete nucleotide sequence in between the 
two primers* In- frame sequences are translated and the G 
stretches are underlined. The putative cleavage sites 
are marked with arrows. Genomic DNA from N. gonorrheae 
15 strains MS11 (P + . PilC + ) , UM01 (P + . PilC + ) , 765 (P + .PilC + ) 
and 605103 (P'n.PilC"), and purified DNA from 
PABJ04/AA10 (recA) was used in the PCR. 

Variant patterns la and lb were identical to 
each other and to the cloned sequence on pABJ04 except 
20 for the presence of 11 instead of 12 G residues in the G 
tract of lb. The G tract of sequence 2a was 13 residues 
long indicating that the sequence is in frame. In 
addition, this sequence differed from pilCl by four 
basepair substitutions outside the G tract, including an 
25 AAA lysine codon four triplets downstream of the putative 
signal peptide processing site which is in agreement with 
the lysine residue found in the fourth position of the 
gel purified 110 Jed Pile protein. Sequences la and lb 
contained CAA, the codon for Gin, at the same position. 
>0 Sequence 2b was identical to 2a except for the presence 
of 12 G residues in the G tract. These data are 
compatible with sequence l being from pilCl and sequence 
2 from pilC2 and further support that pilC2 must be the 
expressed locus in the MS11(P + ) variant we are studying. 
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Strain UM01 apparently only contains one copy 
of pile. DNA from this strain generated five amplified 
fragments ranging in size from 14 8 to 152 bp in the PCR 
reaction. The most abundant fragments were 149-151 bp 
long. Among ten M13 clones, three variant 1 sequences 
were found (a,b,c) that differed only in the number of G 
residues (11-13) in the G tract (Figure 6) supporting the 
hybridization data that this strain contains only one 
pilC gene. Since a Pile protein is expressed from UM01 
we suggest that the majority of cells has 13 Gs in the G 
tract. 

Strain 765 contains two pilC loci, both of 
which seem to be translationally ON based on the presence 
of two high molecular weight proteins reacting with the 
absorbed PilC antiserum. A number of amplified fragments 
were seen after the PCR reaction ranging in size from 149 
to 153 bases. Three variant sequences were found among 
nine clones (Figure 6) . The G tract was 13 residues long 
in variant 3a (in frame) and 14 (out of frame) in variant 
3b whereas sequence variant 4 contained 11 G residues in 
the G tract. Variant sequence 4 contained four 
additional nucleotides (-CAGG-) distal to the G tract 
relative to variant sequences 1, 2 and 3, indicating that 
the amplified product with 11 Gs from this variant 
sequence is 152 long and out of frame. Two PCR amplified 
products 152 and 153 in length were obtained from strain 
765 suggesting that in frame variants of sequence 4 might 
be present in the DNA prepared from this strain. 

Strain 605103 carries two pilC copies, both of 
which seem to be translationally OFF. The amplified 
fragments were 148 and 149 bases in size. Out of eight 
M13 clones only variant la and lb sequences were found, 
with 11 and 12 Gs in the G tract respectively. 
Consequently, we were unable to find an in frame sequence 
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variant from this strain. We do not know if the 5' ends 
of the two pile genes are identical in this strain or if 
one pile gene differed from pilCl in the region 
corresponding to the oligonucleotides used for 
amplif ication. In the latter case we would not expect to 
obtain any amplified products from the second copy. 

The only in frame variant found in DNA 
amplified from N. gonorrhoeae carried 13 Gs in the G 
tract. To see if variants with lOGs arise in products 
expressed in E. coli, PCR amplified products were 
generated from pABJ04 purified from E. coli strain AA10, 
using the same two oligonucleotide primers as before. 
Out of 12 sequenced clones, two carried 10 Gs in the G 
tract (Figure 6) . The majority of clones (seven) carried 
12 Gs as expected. It is likely that the PCR 
amplification products are not representative of the 
original DNA population. However, the distribution of 
variation in the G tract is consistent with a model in 
which only one G residue is gained or lost at one given 
event. Since AA10 is recA, frameshift mutations in the G 
tract in E. coli occur independently of the RecA protein. 

N. meningitidis contains two pile loci 
Southern blot hybridizations using MSll pilCl" specific 
probes identified multiple fragments when meningococcal 
genomic DNA is digested with a variety of restriction 
endonucleases . PCR amplification using two 24-base 
oligonucleotides from the 5' end of MSll pilCl as primers 
yields multiple fragments ranging in size from 148 to 151 
bases. DNA sequencing of fragments cloned into phage Ml 3 
identifies two classes of sequences, as in N. 
gonorrhoeae, which differ outside the G- tract. Variation 
occurred within each class with respect to the number of 
G's in the G- tract. Therefore, N. meningitidis must 
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carry two pile loci, as does N. gonorrhoeae, which should 
also be under the control of translational frame 
shifting. 

Both pile loci are cloned from N. meningitidis 
5 by generating an EMBL3 library and screening this library 
with pilCl' and pilC2' specific DNA from N. gonorrhoeae 
strain MSll. 

Genomic DNA from N. meningitidis is partially 
digested with Sau3A and fragments ranging from 9 to 20 JcB 
10 are ligated into the lambda EMBL3 vector. Because of the 
packaging constraints of the phage, only those lambdas 
which contain DNA fragments of this size will be packaged 
(i.e., are viable). The library thus constructed can be 
screened with genomic oligonucleotide or cloned gene 
15 probes following selection in a lysogenic E. coli strain. 
(Frischart, A.M. et al (1983), J. Mol. Biol. 170:827). 
Preferably, full length clones are identified by 
screening for clones hybridizing to both the 5' and 3' 
ends of pile. If full length clones cannot be obtained 
from the EMBL3 library, pile specific probes may be used 
to screen a plasmid library from the same strains. 

Translational fusion proteins with &- 
galactosidase may also be screened for in a Xgtll 
library, using 0-galactosidase and PilC* specific 
25 antisera in Western immunoblots. 0- galactosidase -PilCl 
and ^-galactosidase- PilC2 fusion proteins are purified 
from the cytoplasm of recombinant E. coli and used to 
raise specific antisera. 

30 Example 7 

The imrpvnQhiolQqical propertie s of PilC 
PilC is located in the outer membrane of 
Neisseria. The immune response during natural infection 
^ can be assessed by screening convalescent sera for anti- 
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PilC antibody. The presence of only two pile loci 
suggests that Pile is only moderately variable, however. 
This together with its essential role in pilus biogenesis 
makes Pile attractive as a potential vaccine candidate. 

Two types of Pile translational fusions using 
alkaline phosphatase and 0-galactosidase are generated. 
In the first instance a secreted fusion protein is 
obtained that may associate with the outer membrane. In 
the second instance the fusion proteins may accumulate in 
the cytoplasm as inclusion bodies. The construction 
schemes for such fusion proteins uses techniques known in 
the art. TnphoA insertions on plasmid pABJ04 in E. coli 
are generated, and a PhoA + phenotype is screened for as 
blue colonies on media containing the chromogenic 
substrate XP. if such clones have the phoA gene in frame 
with an in frame variant of pilCl the fusion product 
should be able to cross the cytoplasmic membrane where it 
can be analyzed by Western immunoblots using an alkaline 
phosphatase specific antiserum and our Pile antiserum 
raised against gel-purified PilC2 from MS11(P + ). 
LacZ: zpilC fusions are generated by cloning different 
segments of pile into a la.cZ' containing vector used to 
generate translational fusions. Similar constructs are 
performed on each of the two pile genes from N. 
meningritidis. Antisera are generated against fusion 
proteins after their purification using conventional 
protocols. These antisera are extensively adsorbed with 
extracts of E. coli expressing alkaline phosphatase and 
0-galactosidase, and used in Western immunoblots and 
ELISA assays against a panel of Neisseria gonorrhoeae and 
Neisseria meningi tidis strains. Antisera raised against 
fusion proteins carrying the major portion of Pile are 
also analyzed in Western blots using E. coli expressing 
fusion proteins containing only smaller regions of Pile. 
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The results of these studies should show which regions in 
PilC are immunodominant. 

The pilCl and pilC2 genes are highly homologous in their 
5' ends whereas the homology is considerably less 
pronounced in the central and 3' region. 

In addition, the entire pilC2 gene from N. 
gonorrhoeae MSli{p + ) is cloned and sequenced. Algorithms 
are used to search for potential T-cell epitopes 
{amphipathic helical conformation) and j8-cell epitopes. 
Polypeptides containing the predicted epitopes are tested 
to determine if they can prime mice for an enhanced 
immune response to PilCl and PilC2. 

Specific PilC antisera are used in 
immunoelectromicroscopy with piliated Neisseria cells as 
15 well as with purified pili to see if Pile is physically 
connected with the pilus fiber. 

Neisseria is grown in the presence of different 
dilutions of PilC" specific antibodies. Bactericidal 
effect exerted by the antiserum, effects on piliation, 
and effects on bacterial attachment to corneal primary 
culture cells are monitored. Binding assays to 
epithelial cells are described in Tjia, K.F. et al . 
(1988), Graefe's Arch. Clin. Exp. Opthalmol. 226:341- 
345. 
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Example fi 

Ident i fication and rhararf.Pri zation of fTpnpa 
located adiart»T^ pile 

The pilCl and pilC2 loci are part of a larger duplication 
that extends both 5'- and 3'- of pile. We know from our 
work with e. coli that strains may contain multiple gene 
clusters for the same class of pili. m one case we have 
shown that the only difference between two duplicated 
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gene clusters (pap and prs) resides in the adhesion genes 
such that each cluster gives rise to serologically 
identical pili binding to different cell surface 
receptors . 

mTnan mutagenesis in the region upstream and 
downstream of pilCl is performed to generate allelic 
replacements in the pilci and pilC2 regions on the 
chromosome. Since the two regions are highly homologous 
we expect to obtain for each insertion allelic 
replacements in either region. Double mutants are 
generated as before by isolating DNA from mutants 
carrying insertions in the p±lC2 region transforming P + 
variants carrying the same insertion in the pllCl region 
and select for transformants resistant to 30Mg/ml of 
chloramphenicol. These double mutants are examined for 
piliation, pilins expression, and binding to corneal 
primary culture cells. 



Example 9 

Phage variation in gonococcal pili expression 
can be CaUflftd bV f raiHPflhif 1- nrnl-ations in m7r 
If Pile is required for pilus formation, we 
would expect some P" progeny arising from a P + clone to 
accumulate unassembled pilin in the absence of Pile. 
Nonpiliated (P") colonies were derived from MSli^fp*) , 
restreaked, and tested for the presence of PilC and pilin 
in immunoblots with the Pile and pili antisera. Five out 
of eight P" clones did not produce detectable levels of 
PilC, but expressed the pilin subunit. The remaining 
three P" clones expressed Pile but not pilin. The 
molecular mass of the pilin subunit was the same in the 
P", PilC - variants as in MSll^fp- 1 -, PilC + ) . However, the 
former in addition produced a protein reacting with the 
pili antiserum that was 16 kd in size. Since MSll^ only 
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contains one expression site for pilin we believe that 
this protein species represents a proteolytic degradation 
product of the pilin and may be identical to the S -pilin 
previously described (Haas et al., 1987). Several 
independent P" clones were isolated from one P~, Pile" 
clone. They all remained PilC" and retained expression 
of pilin. Piliated (P + ) revertants were also obtained 
from the same P~, Pile" clone. These P + revertants 
occurred at about a tenfold lower frequency (10" 4 ) than 
P" derivatives from a P + clone. All P + revertants from a 
P", PilC" clone had regained expression of PilC. All but 
one expressed a pilin with the same molecular weight as 
the nonpiliated parent. However, the low molecular 
weight pilin degradation product was much less abundant 
in the P + , PilC" revertants. It was possible to obtain 
P + revertants from other P", PilC" clones as well, all of 
which expressed PilC. 

The pilE gene from one set of PilC switches was 
PCR amplified and sequenced directly. The P~, PilC", 
pilin* variant 8 carried eight amino acid changes in the 
pilin relative to the parental clone MSll^ The pilin 
sequence of the P + , PilC" backswitcher 8:1 was identical 
to variant 8. Thus, the backswitching from P"to P + 
colony morphology was not associated with any alteration 
in the pilus subunit protein implying that the change in 
colonial morphology was due to the switch in PilC 
expression. 

Strain MSll mk (P') # variants 8(P") and 8:1 (P + ) 
were also examined by transmission electron microscopy. 
Electron microscopy was performed with a JEOL 100CX 
microscope with 200 -mesh copper grids coated with thin 
films of 2% Formvar. The bacterial colonies were 
carefully overlaid with buffer [10 fiM Tris-HCl (pH 7.5), 
10 }M magnesium chloride] and the cells were allowed to 
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sediment for 15 min on a grid* The grids were washed 
with water, negatively stained with 1% sodium 
silicotungstate (pH 7.0) and then washed again. 

The electron micrographs showed that the 
MS11(P + ) parental cells were heavily piliated and pili 
were often seen to aggregate. In contrast most cells of 
variant 8(P") were nonpiliated. One or two pili were 
found on -10% of these cells. All cells of variant 8:1 
(P + ) were piliated, carrying -10-40 fibers/cells. No 
aggregation of individual fibers was seen. These data 
confirm that the observed changes in colonial morphology 
reflect alterations in expression of pili. Therefore, 
phase variation of gonococcal pili may not only be caused 
by recombination events occurring in the pilE locus 
(Bergstrom et al, 1986; Swanson et al ♦ , 1986) but also by 
frameshift mutations in pile. 

Example IQ 

Immunoaenicity of Pile 
In order to predict a region of Pile which 
would have a high probability for antigenicity, residues 
300 to 700 of the putative PilCl protein encoded within 
pilCl were analyzed for antigen index, hydrophilicity , 
and hydrophobicity using standard computer-modelling 
methods. The analysis indicated that the PilCl 
polypeptide fragment containing residues 300 to 700 would 
have several regions with a high antigen index, high 
hydrophilicity, and a high likelihood for location in an 
external domain. 

The immunogenicity of a recombinant polypeptide 
expressed from the DNA encoding amino acids 300 to 700 
was examined. The region of DNA encoding amino acid 
residues 3 00 to 700 was amplified by polymerase chain 
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reaction (PCR) using the following oligonucleotide 
primers . 

5' GGC TAG GTG GCA TAT GAA AGA TAC CGG 3' 
5 and 

5' TTT GCA ATC GG£ G&l CCT * C*A*G GTG TCT TTC 3' 

These primers incorporate an Ndel and a BamHI restriction 
endonuclease site (indicated by the underlined 

10 nucleotides) , respectively. A termination codon 

(indicated by the asterisks) was also incorporated. The 
PCR amplified DNA was then ligated into the vector pET3a 
(between the Ndel and BamHI sites) . The recombinant 
vector pET3a is used in the inducible expression system 

15 described by Studier et al. (1990), using the protocol 
described therein. Strain BL21 (DE3) was transformed 
with the pET3a-pilC (300-700) vector, and the transformed 
strain used for the expression of the PilC (300-700) 
peptide . 

20 The expression products after induction were 

analyzed by sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis (PAGE) using an 8% acrylamide gel and the 
standard 25 mM Tris base / 250 mM glycine / pH 8.3 / 0.1% 
sodium dodecyl sulfate (SDS) electrophoresis buffer. 

25 After electrophoresis, the gel was fixed and then stained 
with Coomassie Blue according to standard protocols, and 
the production of PilC(300-700) was confirmed by the 
detection of the presence in the gels of an abundant, 
appropriately- sized peptide of approximately 46kD. 

30 In order to detect the immunogenicity of the 

PilC(300-700) product, the region of the SDS-PAGE gel 
containing the PilC (300-700) polypeptide was excised from 
parallel unstained lanes, homogenized, and the protein 
eluted into a buffer of 0.1% Triton X-100 in water by 
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passive diffusion. Rabbits were given a priming 
intradermal injection of homogenized gel slices 
(containing approximately 500/xg PilC (300 - 700 ) protein), 
followed 3 weeks later by a subcutaneous boost (of 
5 approximately 500 /xg of eluted protein) . An initial test 
serum was then collected after an additional 14 days. 
All of the test animals yielded a specific high titer 
antibody response. The antibodies induced by PilC (300- 
700) were immunologically reactive not only with that 
10 polypeptide (i.e., PilCl (300-700) ) , but also with native 
PilCl and native PilC2. 

The results demonstrate, inter alia, the 
following. PilC contains antigenic epitopes that can 
elicit a strong immunogenic response. At least some of 
15 the immunogenic epitopes are shared (cross-reactive) 

between PilCl and PilC2, despite differences in primary 
amino acid sequence. The technique of subcloning 
discrete portions of the PilC protein under control of an 
inducible promoter allows mapping of antigenic epitopes. 
Sufficient quantities of specific oligopeptides of known 
antigenicity can be produced for use in screening the in 
vivo immune response after exposure to the intact 
pathogen. 

The following listed materials are on deposit 
25 under the terms of the Budapest Treaty with the American 
Type Culture Collection (ATCC) , 12301 Parklawn Dr., 
Rockville, Maryland 20852, and have been assigned the 
following Accession Numbers. 

30 Description ATCC No. Deposit Date 

PABJ03 in E. coli (DH5) 68519 Jan. 28, 1991 

PABJ04 in E. coli (DH5) 68520 Jan. 28 , 1991 
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Upon allowance and issuance of this application 
as a United States Patent, all restriction on 
availability of these deposits will be irrevocably 
removed; and access to the designated deposits will be 
5 available during pendency of the above-named application 
to one determined by the Commissioner to be entitled 
thereto under 3 7 CFR 1.14 and 35 USC 1.22. Moreover, the 
designated deposits will be maintained for a period of 
thirty (30) years from the date of deposit, or for five 

10 (5) years after the last request for the deposit; or for 
the enforceable life of the U.S. patent, whichever is 
longer. The deposited materials mentioned herein are 
intended for convenience only, and are not required to 
practice the present invention in view of the 

15 descriptions herein, and in addition these materials are 
incorporated herein by reference. 

Industrial Applicability 
The invention, in the various manifestations. 

2 0 disclosed herein, has many industrial uses, some of which 
are the following. The pile DNAs may be used for the 
design of probes for the detection of pile nucleic acids 
in samples. The probes derived from the DNAs may be used 
to detect pile nucleic acids in, for example, chemical 

25 synthetic reactions. The polynucleotide probes are also 
useful in detecting viral nucleic acids in humans, and 
thus, may serve as a basis for diagnosis of pathogenic 
microorganisms containing type 4 pilin, for example, 
gonococcal and/or meningococcal infections in humans. 

30 In addition to the above, the DNAs provided 

herein provide information and a means for synthesizing 
polypeptides containing epitopes of PilC. These 
polypeptides are useful in detecting antibodies to PilC 
antigens. A series of immunoassays the relevant 
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neisserial infection, based on recombinant polypeptides 
containing pile epitopes are described herein, and will 
find commercial use in diagnosing diseases caused by 
these microorganisms, in addition, the polypeptides 
derived from the pile DNAs disclosed herein will have 
utility as vaccines for treatment of infections caused by 
meningococci and gonococci. 

The polypeptides derived from the pile DNAs, 
besides the above stated uses, are also useful for 
raising anti-Pile antibodies. Thus, they may be used in 
vaccines against the relevant microorganisms. Moreover, 
the antibodies produced as a result of immunization with 
the polypeptides containing an immuno reactive Pile 
epitope are also useful as passive vaccines, or in the 
detection of the presence of Pile antigens in samples. 
Thus, they may be used to assay the production of 
polypeptides derived from Pile in chemical systems. The 
anti-Pile antibodies may also be used to monitor the 
efficacy of anti- neisserial agents in screening programs 
where these agents are tested in tissue culture systems. 
Another important use for anti -Pile antibodies is in 
affinity chromatography for the purification of Pile 
derived polypeptides. The purified Pile polypeptide 
preparations may be used in vaccines. 
25 For convenience, the anti -Pile antibodies and 

polypeptides containing regions encoded in pile, whether 
natural or recombinant, may be packaged into kits. 
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CLAIMS 

1. A recombinant polynucleotide encoding a 
polypeptide comprised of an immunoreactive epitope of a 
protein encoded in pile of Neisseria. 

2. The recombinant polynucleotide of claim 1, 
wherein the immunoreactive epitope is encoded in pile of 
Neisseria gonorrheae. 

3. The recombinant polynucleotide of claim 2, 
wherein the immunoreactive epitope is encoded in pilC2 of 
Neisseria gonorrheae. 

!5 4. The recombinant polynucleotide of claim 2, 

wherein the immunoreactive epitope is encoded in pilCl of 
Neisseria gonorrheae. 

5. A vector comprised of a recombinant 
2 0 polynucleotide, wherein the recombinant polynucleotide is 
selected from the group consisting of the recombinant 
polynucleotide of claim 1, the recombinant polynucleotide 
of claim 2, the recombinant polynucleotide of claim 3, 
and the recombinant polynucleotide of claim 4. 



25 



30 



A host cell transformed with the vector of 



claim 5 



7. A recombinant expression system comprising 
a polynucleotide encoding a polypeptide comprised of an 
immunoreactive epitope of a protein encoded in pile of 
Neisseria, wherein the polynucleotide is operably linked 
to a control sequence compatible with a desired host. 
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8. 



The recombinant expression system of claim 



7, wherein the immunoreactive epitope is encoded in pile 
of Neisseria gonorrheae. 

5 9 . The recombinant expression system of claim 

8 f wherein the immunoreactive epitope is encoded in pilCl 
of Neisseria gonorrheae. 

10. The recombinant expression system of claim 
10 8, wherein the immunoreactive epitope is encoded in pilC2 

of Neisseria gronorxheae . 

11. A cell transformed with a recombinant 
expression system, wherein the expression system is 

15 selected from the recombinant expression system of claim 
7, the recombinant expression system of claim 8, the 
recombinant expression system of claim 9, and the 
recombinant expression system of claim 10. 

20 12. A polypeptide produced by the cell of 

claim 11. 

13. A purified polypeptide comprised of an 
immunoreactive epitope of a protein encoded in pile of 

25 Neisseria. 

14. A purified polypeptide according to claim 
13, wherein the immunoreactive epitope is encoded in pile 
of Neisseria gonorrheae. 



30 



15. A purified polypeptide according to claim 
14, wherein the immunoreactive epitope is encoded in 
pilC2 of Neisseria gonorrheae. 
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16. A purified polypeptide according to claim 
14, wherein the immunoreactive epitope is encoded in 
pilCl of Neisseria gonorrheae. 

17. A recombinant polypeptide comprised of an 
immunoreactive epitope of a protein encoded in pile of 
Neisseria.. 

18. A recombinant polypeptide according to 
claim 17 , wherein the immunoreactive epitope is encoded 
in pile of Neisseria gonorrheae. 



19. A recombinant polypeptide according to 
claim 18, wherein the immunoreactive epitope is encoded 

15 in pilC2 of Neisseria gonorrheae. 

20. A recombinant polypeptide according to 
claim 18, wherein the immunoreactive epitope is encoded 
in pilCl of Neisseria gonorrheae. 



35 



21. A method of preparing a recombinant 
polypeptide comprised of an immunoreactive epitope of a 
protein encoded in pile of Neisseria, the method 
comprising: 

a. providing a host cell according to claim 

11; 

b. incubating the cell under conditions which 
allow expression of the recombinant polypeptide; and 

c. isolating the polypeptide. 

22. A vaccine composition for the treatment of 
Neisseria infection, comprised of a pharmaceutical^ 
acceptable excipient and of an effective amount of a 
recombinant polypeptide, wherein the polypeptide is 
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comprised of an immunoreactive epitope of a protein 
encoded in pile of Neisseria. 

23 . The vaccine composition of claim 22 , 
wherein the immunoreactive epitope is encoded in pile of 
Neisseria gonorrheae . 

24. a polypeptide affixed to a solid 
substrate, wherein the polypeptide is selected from the 
group consisting of the polypeptide of claim 12, the 
polypeptide of claim 13, the polypeptide of claim 14, the 
polypeptide of claim 15, the polypeptide of claim 16, the 
polypeptide of claim 17, the polypeptide of claim 18, the 
polypeptide of claim 19, and the polypeptide of claim 20. 



25. An immunoassay for detection of anti- 
Neisseria antibodies comprising: 

(a) providing a sample suspected of containing 
ant i -Neisseria antibodies ; 

20 (b) Providing an antigen, wherein the antigen 

is a polypeptide selected from the group consisting of 
the polypeptide of claim 12. the polypeptide of claim 13, 
the polypeptide of claim 14, the polypeptide of claim is, 
the polypeptide of claim 16, the polypeptide of claim 17, 
the polypeptide of claim 18, the polypeptide of claim 19, 
the polypeptide of claim 20; and 

(c) incubating the sample of (a) with the 
antigen of (b) under conditions which allow the formation 
of antibody -antigen complexes; and 
30 (d) detecting the presence of ant i -Neisseria 

antibody-antigen complexes formed in (c) , if any. 
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26. A composition comprised of a polypeptide, 
wherein the polypeptide is selected from the group 
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consisting of the polypeptide of claim 17, the 
polypeptide of claim 18, the polypeptide of claim 19, and 
the polypeptide of claim 20. 

27 . A composition comprised of purified 
polyclonal anti-Pile antibodies, wherein the Pile is of 
Neisseria. 

28. A composition comprised of a monoclonal 
antibody directed against an immunoreactive epitope 
encoded in pile of Neisseria. 



29 . An immunoassay for detection of an antigen 
encoded in pile of Neisseria comprising: 

15 (a> providing a sample suspected of containing 

an antigen encoded in pile of Neisseria; 

(b) providing a composition comprised of 
antibodies directed against the antigen encoded in pile 
of Neisseria, wherein the composition is selected from 
the group of compositions of claim 27 and claim 28; 

(c) reacting the sample of (a) with the 
antibody containing composition of (b) under conditions 
which allow the formation of anti-PilC antibody- antigen 
complexes; 

25 (d) detecting anti-PilC antibody -antigen 

complexes formed in (c) , if any. 

30. A kit for analyzing samples for the 
presence of anti-Pile antibodies comprising: 

(a) an antigen packaged in a suitable 
container, wherein the antigen is a polypeptide selected 
from the group consisting of the polypeptide of claim 12, 
the polypeptide of claim 13, the polypeptide of claim 14, 
^ the polypeptide of claim 15, the polypeptide of claim 16, 
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the polypeptide of claim 17, the polypeptide of claim 18 , 
the polypeptide of claim 19, and the polypeptide of claim 
20; 

(b) a buffer used in the performance of the 
5 analysis, packaged in a suitable container; and 

(c) instructions on the performance of the 
analysis which uses the antigen of (a) and the buffer of 
(b) . 

10 31. A kit for analyzing samples for the 

presence of an antigen comprised of an immunoreactive 
epitope encoded in pile of Neisseria comprising: 

(a) a composition comprised of antibodies 
directed against the antigen comprised of an 

15 immunoreactive epitope encoded in pile of Neisseria, 
wherein the composition is selected from the group of 
compositions of claim 27 and claim 28, wherein the 
composition is packaged in a suitable container; 

(b) a buffer used in the performance of the 
analysis, packaged in a suitable container; and 

(c) instructions for performing the analysis. 



20 



32. A method for producing antibodies to Pile 
of Neisseria comprising administering to an individual a 
25 composition comprised of an isolated immunogenic 

polypeptide containing a Pile epitope in an amount 
sufficient to produce an immune response to the Pile 
epitope. 

30 33. An oligomer capable of hybridizing to a 

sequence in pile of Neisseria, wherein the oligomer is 
comprised of a pile sequence complementary to at least 
about 6 contiguous nucleotides of pile. 
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34. An oligomer according to claim 33, wherein 
pilC is of Neisseria, gonorrheae* 

35. A process for detecting a pile sequence in 
5 an analyte strand, wherein the pile sequence comprises a 

selected target region, the process comprising: 

(a) providing a sample comprised of an analyte 
strand suspected of containing a selected target pile 
sequence; 

10 (b) providing an oligomer capable of 

hybridizing to the target pilC sequence, wherein the 
oligomer is comprised of a pilC targeting sequence 
complementary to at least about 6 contiguous nucleotides 
of pilC; 

15 (c) incubating the sample of (a) with the 

oligomer of (b) under conditions which allow specific 
hybrid duplexes to form between the targeting sequence 
and the target sequence; and 

(d) detecting hybrids formed between the 

20 target sequence, if any, and the oligomer. 

36. The process of claim 3 5 which further 

comprises: 

(a) providing a set of oligomers which are 

25 primers for a polymerase chain reaction (PCR) method and 
which flank the target region; and 

(b) amplifying the target region via the PCR 

method. 

30 37. A kit for detecting a pilC sequence in an 

analyte strand comprising: 

(a) the oligomer of claim 33, packaged in a 
suitable container; 
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(b) a suitable buffer, packaged in a suitable 
container; and 

(c) instructions for performing the detection. 

38. A recombinant polynucleotide comprising a 
DNA sequence of at least 8 contiguous nucleotides from 
pile, wherein the pile sequence is selected from the 
group of sequences shown in Figure 3, Figure 6, and 
Figure 7. 



39. A method of treating an individual for a 
Neisseria infection comprising administering to the 
individual antibodies produced according to claim 32, 
wherein the antibodies are administered in an amount 

15 effective to prevent the pathology of the infection. 

40. An immunoassay for detection of anti- 
Neisseria antibodies comprising: 

(a) providing a sample suspected of containing 
20 anti-Neisseria antibodies; 

(b) providing an antigen, wherein the antigen 
is the polypeptide of claim 24; 

(c) incubating the sample of (a) with the 
antigen of (b) under conditions which allow the formation 

25 of antibody-antigen complexes; and 

(d) detecting the presence of anti-Weisseria 
antibody-antigen complexes formed in (c) , if any. 

41. A kit for analyzing samples for the 
30 presence of anti-Pile antibodies comprising: 

(a) an antigen packaged in a suitable 
container, wherein the antigen is a polypeptide from 
claim 24; 
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(b) a buffer used in the performance of the 



anaylsis, packaged in a suitable container; and 



(c) instructions on the performance of the 



analysis which uses the antigen of (a) and the buffer of 



5 (b) . 
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msii FIG. 6a 

y J (Piic*) 

la CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGC GGGGCGnnnnr.n 
lb CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGG CGGGGGGGGr.r.G 

2a CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGC GGGGGGGGGGGGn 
His Thr Ala LeuTyr Ala Ala lie Leu Met Phe SerHis ThrGly Gly Gly Gly GlyAla 

2b CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGG CGGGGGGGGGGGG 

UMOl 

i (PilC ♦) 

lC CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGC GGGGGGGGGGGGn 
His Thr Ala LeuTyr Ala Ala I le Leu Met Phe SerHis ThrGly Gly Gly Gly GlyAla 

la CATACCGCGCTTTATGCCGCCATCTTG ATGTTTTCCCATACCGGC GGGGGGGGnnnG 
lb CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGC GGGGGGGGnGG 

765 

(PHC*) 

33 CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGG CGGGGGGGGGGnGG 
His Thr Ala LeuTyr Ala Ala I le Leu Met Phe SerHis ThrGly Gly Gly Gly GlyAla 

3b CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGG CGGGGGGGGGGGGnr. 
4 CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGG CGGGGGGGGGGGC An 

605103 

! (PilC) 

la CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGC GGGGGGGGGGGG 
lb CATACCGCGCTTTATGCCGCCATCTTG ATGTTTTCCCATACCGGC GGGGGGGGGG 

pABJ04 

lC CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGC GGGGGGGGGnnr.r; 
His Thr Ala LeuTyr Ala Ala I le Leu Met Phe SerHis ThrGly Gly Gly Gly GlyAla 

la CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGC GGGGGGGGGGGG 

lb 

CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGCGGGGGGGGGGG 

Id CATACCGCGCTTTATGCCGCCATCTTGATGTTTTCCCATACCGGC GGGGGGGGnn 
His Thr Ala LeuTyr Ala Ala lie Leu Met Phe SerHis ThrGly Gly Gly GlyAla 
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FIG. 6b Sequenced 

2 35 G-stretch clones 

CGATGGCGCA A ACCCATCA ATACGCTATTATCATGA ACG AGCG A 12 G H 

CG ATGGCGCA A ACCCATCA ATACGCTATTATCATG A ACG AGCGA HQ 6 

CGCAGGCGC A A ACCCGTA A ATACGCTATTATCATGA ACG AGCG A 13 C 2 
Gin Ala Gin Thr Arg Lys Tyr Ala lie I le Met Asn Glu Arg 
fl 2 3 4 5 5 7 8 9 10 SI 12 

CGCAGGCGCA A ACCCGTA A ATACGCTATTATCATG A ACG AGCG A 12 G 1 

CG ATGGCGCA A ACCCATCA AT ACGCTATTATCATG A ACG AGCG A 13 G 7 
Met Ala Gin Tnr His Gin Tyr Ala lie Me Met Asn Glu Arc 

CG ATGGCGCA A ACCCATCA ATACGCTATTATCATG A ACG AGCG A 12 G 1 

CGATGGCGCA A ACCCATCA ATACGCTATTATCATGA ACG AGCG A 11 G 2 

CGATGGCGCA AACGTATAAATACGCTATTGTGATG AACGAGCGA 13 G 1 
Met Ala^Gln Thr Tyr Lys Tyr Ala I le Val Met Asn Glu Arg 

CGATGGCGCAA ACGTATAAATACGCTATTGTGATG AACGAGCG A 14 G 5 

GCGCAGGCGC A AACGTATAAATACGCTATTGTGATG AACGAGCGA 11 G 3 

CGATGGCGCA A ACCCATCA AT ACGCTATTATCATG A ACG AGCG A 12 G 1 

CG ATGGCGC A A ACCCATCA AT ACGCTATTATCATG A ACG AGCG A HQ 8 

CGATGGCGCAAACCCATCAATACGCTATTATCATGA ACG AGCGA 13 G 2 
Met Ala^ln Thr His Gin Tyr Ala lie lie Met Asn Glu Arg 

CG ATGGCGC A A ACCCATC A AT ACGCTATTATCATG A ACG AGCG A 12 G ? 

CG ATGGCGCA A ACCCATCA AT ACGCTATTATCATG A ACG AGCG A H Q 1 

CGATGGCGCAA ACCCATC A AT ACGCTATTATCATG A ACG AGCG A inp 2 
Met Ala Gin Thr His Gin Tyr Ala lie lie Met Asn Glu Arg 
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